n 

ON 

: o 



to; 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
UTILITY PATENT APPLICATION TRANSMITTAL 
UNDER 37 CFR 1.53(b) 






= Cfl 

I Address to: 


Attorney Docket No. 


STL000023US1 


D M 

H 




5 Box Patent Application 


Inventor(s) 


William J. BaeretaL, 






Assistant Commissioner for Patents 


Express Mail Label No. 


EJ495233834US 


CM~^ - 

*±XX* 




Washington, DC 20231 


Filing Date 


January 2 1,2000 


0 
■f—l 



S 0 



2»te of Application: PROVIDING A FUNCTIONAL LAYER FOR FACILITATING CREATION AND 

MANIPULATION OF COMPILATIONS OF CONTENT 



63 
36 



Transmitted with the patent application are the following: 

1 5 1 Page(s) Specification, Claims and Abstract 
Appendix 
Informal drawings 
Declaration and Power of Attorney. 

Assignment of the Invention to International Business Machines Corporation 

Information Disclosure Statement (IDS/PTO 1449) (copies of citations not included in number of pages) 
Copies of IDS citations 
Return Receipt Postcard (MPEP 503). 



Pages 
Page(s) 
Page(s) 
Page(s) 
Page(s) 



J This application is a:/ 

5 _ Continuation _ Divisional _ Continuation-in-Part 

M3 p Y i or application information: Examiner: , 

O Fee Calculation: 



of prior application Serial No. 

Group Art Unit: 





Claims 




Extra 


Rate 


Fees 


Basic Fee 










$690.00 


Total Claims 


15 


-20 = 




x $18,00 




Independent Claims 


3 


-3 = 




x $78.00 




Multiple Dependent Claims 








+$260.00 










TOTAL 


$690.00 



X Please charge my Deposit Account No. 09-0460 in the amount of $690.00. A duplicate 
X The Commissioner is hereby authorized to charge payment of the following fees associated 

overpayment to Deposit Accoun t 09-0460 . A duplicate copy of this sheet is attached. 
X Any filing fees under 37 CFR 1.16 for the presentation of extra claims. 
X Any patent application processing fees under 37 CFR 1.17. 



EXPRESS MAIL CERTIFICATE 

I hereby certify that the above paper/fee is being deposited 
with the United States Postal Service "Express Mail Post Office to Addressee" 
service under 37 CFR 1 . 1 0 on the date indicated below and is addressed 
to the Assistant Commissioner for Patents, Washington, DC 20231. 



copy of this sheet is attached. 

with this communication or credit any 



Respectfully submitted, 
William J. Baer, et al, 



^^^^ 



Date of Deposit: January 21, 2000 

Person Mailing paper/fee: Je^tte 
Signature: 



[ailiftg paper/fee: Jeaaette &rrvl)urbin 




IngridkM. Foerster, Esq., 
Registration No. # 36,5 1 1 
Attorney for Applicants 
Telephone (408) 463-5675 
IBM Corporation 
Intellectual Property Law 
555 Bailey Avenue (J46/G475) 
San Jose, CA 95141-1003 



PRdviDING A FUNCTIONAL LAYER FOR FACILITATING 
CREATION AND MANIPULATION OF COMPILATIONS OF CONTENT 



5 CROSS REFERENCE TO RELATED APPLICATIONS 

This application is related to the co-pending and commonly assigned patent applications 
listed below, which were filed herewith on January 21, 2000 and are all incorporated by reference 
herein: 

10 

A System and Method for Creating Compilations of Content 

Serial No. _/ , ( Our reference Docket # STL000012US1) 

Method and System for Adding Content to a Content Object Stored in a Data Repository 

15 Serial No. / . (Our reference Docket # STL00001 3US1 ) 

Method and System for Adding User-Provided Content to a Content Object Stored in a Data 
=** Repository 

t Serial No. __/ , (Our reference Docket # STL00001 4US1) 

feo 

j: Method and System for Moving Content in a Content Object Stored in a Data Repository 

^ Serial No. _/_, (Our reference Docket # STL000015US1) 

jy 

T 1 Method and System for Removing Content in a Content Object Stored in a Data Repository 

225 Serial No. _/ , (Our reference Docket # STL000016US 1) 

Prerequisite Checking in a System for Creating Compilations of Content 

3 Serial No. / . (Our reference Docket # STL000017US1) 

l80 Method and System forPreventing Mutually Exclusive Content Entities Stored in a Data Repository 

;: ;f Serial No. _/ (Our reference Docket # STL000018US1) 

Z Volume Management Method and System for a Compilation of Content 

" J Serial No. / . (Our reference Docket # STL000019US1) 

35 

Method and System for Calculating Cost of a Compilation of Content 

Serial No. _/ , (Our reference Docket # STL000020US1) 

Method and System for Storing Hierarchical Content Objects in a Data Repository 

40 Serial No. _/ , (Our reference Docket # STL000021US1) 

File Structure for Storing Content Objects in a Data Repository 

Serial No. / . (Our reference Docket # STL000022US1) 

45 A Hitmask for Querying Hierarchically Related Content Entities 

Serial No. _/ , (Our reference Docket # STL990182US1) 

A Method and Configurable Model for Storing Hierarchical Data in a Non-Hierarchical Data 
Repository 

50 Serial No. _/_, (Our reference Docket # STL000025US1) 



STL000023US1 



-1- 



BACKGROUND OF THE INVENTION 



Field of the Invention 

5 The present invention relates to content management, and more specifically, to a system, 

method and program product for creating compilations of content from hierarchical content 
stored in a data repository. 

Background of the Invention 

10 

Content management systems have enabled content of all types, e.g., text, still images, 
C 3 moving images, and audio content, to be stored digitally. Content management systems include, 
rg for example, relational databases, digital libraries, and media servers. They have further 

provided functions for manipulating the content, e.g., searching and editing capabilities. 
Ql5 It would be desirable to enable a user to take advantage of vast stores of content to create 

T compilations tailored to the user's needs or desires. For example, a university professor would 
jhf find value in creating custom textbook tailored to a specific course from prepublished textbooks 
Hi stored in a content management system This compilation could be further enhanced to include 
fj associated multimedia materials. As another example, a music lover would benefit from a 
:sJ 20 system that allows him to specify musical selections to be included in a custom album. Such 

systems would have to partition large content objects (e.g., albums, books, videos) into smaller, 

selectable objects (e.g., musical selection, chapter section, episode) for inclusion in a 

compilation. 

25 SUMMARY OF THE INVENTION 

A web-based system, method and program product are provided for creating a 
compilation of content stored in a data repository as a group of hierarchically related content 
entities, managing, displaying, and searching the content, then creating and exporting 
30 compilations of content for publication. Also provided are a system, data structure, method, and 
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program product for storing content into a repository for use in creating a compilation of content. 

The content is hierarchical in nature. Accordingly, entities at each level of the hierarchy 
except the lowest are defined by "containers". For example, in the case of textual content, the 
hierarchical structure of the data may include book containers, volume containers, chapter 
5 containers, and subsections (noncontainers, because they are at the leaf level of the hierarchy). In 
the case of audio content, the hierarchical containers may be album, compact disk, and musical 
selection, and excerpts of the musical selections are defined as noncontainers. In the case of 
video content, the hierarchical containers may include movies and excerpts from each movie, and 
frames are defined as noncontainers. If desired, the maximum size of a container may specified. 
10 For example, the volume size in a custom book is preferably determined using a threshold value 
defining maximum amount of content allowable for that container, and a procedure is provided 
:fl for managing content entities and containers to maintain this maximum. 

Ifj The hierarchical data and associated metadata are preferably stored in a digital library that 

:f includes search support. A web-based user interface is provided for presenting a user with a 

Gt 5 plurality of selectable objects, each object representing a subset of the hierarchical data (e.g., 

IH 

" " chapter subsections, musical excerpts, video excerpts, etc.). The plurality of objects may 
P represent all subsets of the stored content or less than all of the subsets (e.g., categorizing the 
III content and by providing a bookshelf for each category that a user may browse). The user then 
h selects one or more of the objects for inclusion in a compilation (e.g., a custom textbook). 
C 20 Alternatively, the user may search the content by specifying search criteria through the interface. 
Additionally, the user may create new content, e.g., a new chapter or section, for inclusion in the 
final compilation by inputting user-provided material through the web interface. The system 
preferably stores the new content and creates a reusable, selectable object associated with the 
new content. 

25 Each noncontainer content object is preferably stored as a separate entity in the data 

repository. Each content entity is also stored as a row in a digital library index class as a 
collection of attributes and references to related content entities. Each containter and 
noncontainer is associated with a unique identifier that preferably includes hierarchical 
information about its position in the hierarchy. 
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As the user selects desired objects for inclusion in a compilation, the system arranges the 
objects hierarchically, e.g., into volumes, chapters and sections according to the order specified 
by the user. The system then creates a file object (e.g., a CBO) defining the compilation that 
contains a list or outline of the content entities selected , their identifiers, order and structure. 
5 This file object is stored separately in the data repository. 

The list or outline is presented to the user at the web interface as a table of contents, and 
may be edited through the interface. For example, the user may add content, delete content, or 
move content within and across containers. Editing the list or outline redefines the structure of 
the compilation. Once the user is satisfied with the organization of the compilation, it is 
10 submitted it for publication. The submitted compilation is then forwarded to an approval process 
and is accepted, rejected, or returned to the user with editorial comments appended by the editor. 
^ An aspect of the invention is the calculation of the compilation's cost by estimating the 

5 amount of content it contains and determining a content cost based upon the content estimate. 
:£ Optionally, a cost is assigned to each content entity in the data repository and these actual costs 
Pi 5 are summed as part of the cost estimation procedure. 

: Another aspect of the invention is to provide permission checking. Occasionally, it may 

j f be desired to prevent certain content entities from appearing a same compilation as other content 
it! entities. For example, an author may specify that his work can not be published in the same 
q compilation as the work of another author. Permission checking first requires associating each 
u 20 container and noncontainer with any mutually exclusive containers or noncontainers. For 

example, such association may be achieved by defining a set of rules specifying containers and/or 
content entities that are mutually exclusive. Upon selection of a container or noncontainer to add 
to the compilation, the permission checking procedure determines if the container or 
noncontainer is mutually exclusive of any other containers or content objects, e.g., by consulting 
25 the rules. If so, the permission checking procedure then analyzes the compilation outline to 

determine whether any of the other mutually exclusive containers or noncontainers already exists 
in the compilation. If so, then the selected container or noncontianer is not added to the 
compilation and the user is notified that the content selected may not be included in the 
compilation. Otherwise, the content is added. 
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A further aspect of the invention is to provide prerequisite checking, wherein some 
entities are associated, e.g., by a set of rules, with content objects that are prerequisites to that 
object (e.g., front or backmatter associated with the subsection such as an introduction, appendix, 
or bibliography), and wherein selection by the user of an entity prerequisites causes automatic 
inclusion of all associated prerequisite objects in the final compilation. 

Another aspect of the invention is the provision of a functional layer between the user 
interface and data repository for facilitating the creation, manipulation, storage and management 
of content objects in the data repository. 

Another aspect of the invention allows a user to create multiple compilations 
concurrently. Yet another aspect of the invention allows a user to modify a compilation by 
creating a clone or copy of the compilation and applying user-specified changes to the copy (e.g., 
in the creation of a new edition or version of an existing work.) 

Other aspects of the invention include a configurable model for storing hierarchically 
related data in a relational database, and a data structure for storing the data and associated 
metadata, whereby the hierarchical relationship of the data is preserved. 

As a further aspect of the invention, queries are executed on the hierarchical containers 
and noncontainers through an application or user-interface. The results of the independent 
searches are merged using hit masks. A hit mask is a string of bits, each bit representing a query. 
For each container and noncontainer in the result set, a hit mask is generated and ones of the bits 
are set to indicate which of the queries the container or noncontainer satisfies. Container hit 
masks are OR-ed with their child containers and/or noncontainers to reflect inheritance. 
Containers and noncontainers with all bits set comprise the merged result set. 

DESCRIPTION OF THE DRAWING 

Fig. 1 is a block diagram representing the content management system according to the present 
invention; 

Fig. 2 is a block diagram representing the content input path of the present invention; 
Fig. 3 is a block diagram representing a digital library suitable for practicing the present 
invention; 

Fig. 4 graphically represents the structures for storing content parts in a digital library; 
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Fig. 5 graphically represents the index classes used in storing content in a digital library; 
Fig. 6 is a block diagram representing the path for creating and submitting compilations of 
content according to the present invention; 

Fig. 7 represents parts of a compilation of content stored in the digital library; 

Figs. 8A - 21B represent the interface of an embodiment of the present invention; 

Figs. 22A - 22D represent the system administrator interface of an embodiment of the present 

invention; 

Fig. 23 is a block diagram representing the path for approving and publishing compilations of 
content; and 

Fig. 25 is a state diagram representing the states of a user, request and CBO at various stages of 
the process for creating compilations of content. 

DETAILED DESCRIPTION 
I. System Overview 

Figure 1 functionally depicts a system for creating compilations of content. It comprises 
three parts: a path for inputting content to the data repository (Fig. 2), a path for enabling a user 
to select content and organization from the data repository through a web-based interface for 
inclusion in a compilation of content (Fig. 3), and a path that interfaces with a publishing system 
for creating the compilation of content from the user's specification (Fig. 2). Each path will be 
described in detail below. 

The present invention will now be described in terms of a specific embodiment for 
creating custom textbooks. The intended user group comprises university professors, for 
example. The content stored in the system comprises a plurality of published textbooks, broken 
down into hierarchically related objects: book, volume, chapter and chapter subsection. 

Using the proposed system in this context, a university professor is able to access content 
from a collection of textbooks stored in a digital library and select books, volumes, chapters 
and/or chapter subsections for inclusion in a custom textbook, and is further able to create 
content objects for inclusion in the final work. 
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Although the specific embodiment is provided to facilitate the reader's understanding, it 
will be understood that present invention is of a much broader scope and may be applied in the 
creation of compilations of all types of content including text, image, audio and video content. 

5 A. Receiving and Storing Content 

In the exemplary embodiment of the invention, content and other information is input to 
digital library 20 through the input data path shown in Fig. 2. Briefly, the content and other 
information is input by a user in at an input interface represented by block 8. In the preferred 
embodiment, the input content is provided in SGML format, although other formats may be 
10 supported if desired. The content is forwarded by input application 8 to a converter 10 for 
conversion into the format expected by data loader 14. 
=fl After reformatting, converter 10 outputs the reformatted content and other information to 

5 a loader application 14. Loader 14 receives and maps the data for storage in the data repository 
JS according to a configuration model 12. According to the present example, the data repository is a 

0L5 digital library 20, and the configuration model 12 is specific to the IBMDB2(R) Digital Library 

\ fii 

I* " data storage model. Loader 14 interfaces with the digital library 20 through the digital library 

client application 16. Using the configuration model 12, the content loader 14 is able to map the 
ill content and other information it receives in a manner appropriate for the structure of the 
3*i underlying digital library 20. However, the loader 14 of the present invention may be 
y 20 reconfigured for other types of data repositories by defining a configuration model 12 for each 
data repository used. Thus if the data repository type is later changed, the configuration file 12 
can be updated to reconfigure the input path without having to reprogram the loader application 
14. 

The elements of the input path will now be described in greater detail. 

25 

1. Digital Library 

Examples of digital libraries suitable for use in the present invention are described in 
commonly owned U.S. Patent Nos. 5,787,413 entitled "C++ classes for a digital library" issued 
to Kauffman et al., and 5,857,203 entitled "Method and apparatus for dividing, mapping and 
30 storing large digital objects in a client/server library system" also issued to Kauffman et al. 
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In the preferred embodiment of the present invention, the data repository comprises the 
commercially available IBM DB2 Digital Library. However, other commercially available data 
repositories may be used either in combination with, or in lieu of, the DB2 Digital Library 

Digital libraries are used to store and manage a wide variety of digital objects such as 
5 documents, graphics, audio, video, spread sheets and word-processing text. A conceptual view 
of a conventional digital library client/server system is shown in Fig. 3 and includes a library 
server 44, one or more object servers 48 and a library client 42. Each of the library and object 
servers and the library client includes an information store. That is, the library server 44 
includes a library catalog 46, the is object server 48 includes an object store 50 and the library 
10 client 42 includes a client cache 40. The client applications interface to the digital library through 
an object-oriented API 16. Also, a communications isolator (not shown) is included which 
allows the library server 44, object server 48 and library client 42 to communicate with one 
another without concern for complex communications protocols. 

The library server, object servers and library clients are connected by a communications 

1315 network, such as a wide-area network (WAN), but also can be locally connected via a local area 

i ft 

T' network (LAN). In the conventional library client/server system the library client 42 is typically 
F? embodied in a workstation, such as a personal computer, and the library server 44 and object 
ill servers 48 are typically embodied in a host processor: generally a mainframe computer 
S environment such as a MVS/ESA environment running under CICS. The library server 44 uses a 
%) relational database such as the IBM DB2 Universal Database or the Oracle database as a library 

catalog 46 to manage digital objects and provide data integrity by maintaining index information 

and controlling access to objects stored on one or more object servers. Object servers can also 

use a relational database such as IBM DB2 or the Oracle database to manage their contents. 

Library servers and object servers run, for example, on ADC and Windows NT. 
25 Library Server. The library server 44 directs requests from clients to update or query 

entries in the library catalog 46, which contains object indexes and descriptive information. 

Library server 44 additionally performs searches and routes requests to the appropriate object 

server 48 to store, retrieve, and update objects. 

Each user is assigned a set of privileges for access to the library by a system 
30 administrator. Library server 44 checks library catalog 46 before processing a request to ensure 
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that the user's name and password are valid, and to ensure that the user has been granted the 
appropriate privileges to perform the requested action. An example of a library privilege is the 
ability to delete objects. In typical implementations, there are groups of individuals who need 
access to the same objects. Therefore, to simplify the process of granting access to objects a 
5 system administrator can define patrons as members of a group. When a patron is defined as a 
member of a group, that patron is able to access any object for which the group has been granted 
privileges. 

The library server 44 also checks to ensure that the object's owner has granted the patron 
the privileges needed to do what is requested (e.g., update the object). The owner of an object is 
10 the user who first stored the object. When an owner stores an object that owner must specify 
which other patrons are to have access to the object. 
# If a client request involves the storage, retrieval, or update of an object, library server 44 

EQ forwards the request to the object server 48 that contains or will store the object(s) referred to in 
S the request based upon information provided by library catalog 46. If the client request is a query 
Ol5 of the information stored in library catalog 46, library server 44 will interact only with the library 
I catalog 46 and will not contact object server 20. 

|;f Library Catalog. The library catalog 46 is analogous to a conventional library's card 

W catalog. It is a set of database virtual tables or index classes which contain an index of all the 
O objects stored in the library system and the object servers owning them. Each row of these 
U 20 virtual tables or index classes references one or more stored objects. Implicitly, the first column 
of each index class contains a unique digital library item identifier (e.g., the IBM DB2 Digital 
Library ItemID) for the object referenced by its corresponding row. Other information stored in 
an index class may include textual descriptions for each object, information on the type of object 
(e.g., image object, spreadsheet, text document), user names and privileges, access authorization 
25 data for each object, links between objects, and an object's properties. 

An item is a row in an index class and a part is a file within the object server 48 that is 
stored in an access managed directory structure. The management access of the directory 
structure is performed by the object server 48, but the directory structure responsibilities are 
performed by the operating system (i.e. AIX, NT, MVS). 
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The library server 44 contains a parts table 62, as shown in FIG. 4, which resides in the 
library catalog 46. For each part or object in the library system, library server 44 stores 
information about that part. As shown in the parts table 62 in FIG. 4, the information stored for a 
part includes the item identifier (ItemID), a part number (PartID), a representation type (REP 
5 type) and an object server ID identifying which object server contains the object. In the presently 
described embodiment of the invention, the REP type is a default value (FRN$NULL). 

When a part is stored in the conventional client/server library system 20, library server 44 
assigns an item ID and a part number, which are 16 bytes and 4 bytes long, respectively. The item 
ID is a unique identifier for an item (i.e. row in the library server index class) to which the part 
10 belongs. For example, an item could represent a folder in which the part represents a document 
within that folder. Likewise, the part number is a unique identifier for that part, 
ifl The REP type field can be used to indicate the type or class in which the part is classified. 

"S s *-B 

ffl For example, if the part is an image stored in a TIFF format, the REP type for that part could 
M indicate that the part is a TIFF formatted image. 

Ol5 Object Servers. An object server 48 maintains objects stored within the library system. 

,1 Objects are stored or retrieved from an object store 50 by object server 48. Object server 48 
! = f receives requests from library server 44 and communicates with library client 42 to complete the 
W requests. Such a library system can contain several distributed object servers. Referring to Figs. 3 
b and 4, the object server field in the library server's parts table 62 indicates the identifier for the 
W 20 object server 48 which owns the part. For example, if the part is stored on object store 50 of 
object server 48, the object server ID field will contain the identifier for object server 48. 

Each object server 48 contains an object server table 64 as shown in Fig. 4. The object 
server 48 uses object server table 64 to manage storage of parts in its storage areas, such as the 
object store 50. Object server table 64 also contains the same item ID, part number and REP type 
25 for the part as does the library server parts table 62. The object server table also contains a file 
name for the part 66, which indicates the location in object store 50 of stored part 66. 

When a user's privileges are defined a default object server can be set for that user. When 
the user stores an object, it will be stored in his default object server. If it is later determined that 
an object or a group of objects should be relocated to a different object server, a client 
30 application can cause those objects to be moved from one object server to another. 
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Library Client. The library client 42 is the interface through which application 
programs can submit requests to the library system. These can include requests to store objects, 
update/add descriptors to objects, delete objects and query information in the library catalog. 
Library requests can be submitted through the library client either individually or in batches. 
5 The library client 42 includes a client cache 40 used to locally hold copies of objects that 

have been stored to or retrieved from the object server 48. These local copies allow very fast 
access to objects and provide a means for communicating between the library client 42 and the 
servers 44, 48. 

Additional Search Support. IBM DB2 Digital Library includes parametric search 
10 support, and is integrated with text search support from the IBM Intelligent Miner for Text. The 
library server 44 may be further integrated with other search support 52. For example, image 
; fl querying may be provided by IBM's Query by Image Content(QBIC) technology (see 
5 commonly owned U.S. Patent No. 5,579,47 1 to Barber et al.). 

J{j In the present example for creating compilations of text, library server 44 is preferably 

Ol5 coupled to the IBM Intelligent Miner for Text full text search support, allowing the user to 
IT automatically index, search, and retrieve documents based on a full text search. Text Miner 
H allows users to locate documents by searching for words or phrases, abbreviations and acronyms, 
RJ and proper names. In a typical LAN environment, a text search installation comprises one or 
S more servers and several clients. The text search server program is installed on a machine with 
* :3 20 other Digital Library components. The text search client resides on client workstations and 

provides access to the server. Text search runs, for example, on ATX and Windows 95 and NT. 
In addition to the server and client components, text search uses dictionaries to support the 
linguistic processing of documents in different languages during indexing and retrieval. 
Dictionaries are installed on the server workstation, and at each client workstation. 
25 Data Flow. Referring to Figs. 3 and 4, when a requesting library client 42 requests an 

object, or blob, it sends a request to library server 44. Upon receipt of the request library server 
44 consults the parts table 62, among other tables, in the library catalog 46 and determines which 
object server 48 owns and has the requested object stored in its object store 50. The request 
contains the item ID, part number and REP type of the requested part. Upon receiving the 
30 request, object server 48 retrieves the blob from object store 50 by consulting its object server 
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table 64 and sends a copy of it to client 42. Object server 48 stores the blob in client cache 40. 
When the blob is successfully transmitted to client cache 40 object server 48 sends a response to 
library server 44 indicating a successful transfer of the blob to client cache 40. Library server 44, 
in turn, sends a response to requesting library client 42 indicating that the blob was successfully 
transferred, which allows the client 42 to retrieve the blob from client cache 40 for use by a client 
application. 

When an application program submits a request for storage of an object in the library 
system, library client 42 creates a copy of the object in its client cache 40 to allow the appropriate 
object server 48 to retrieve the object. The library client then sends a storage request to library 
server 44. Included in the storage request is a handle to the object stored in the client cache 40. 
The handle is an identifier which is used to locate the object in the client cache. 

Upon receiving the storage request, library server 44 updates tables in library catalog 46, 
including the parts table 62 shown in Fig. 4, to identify the object server 48 in which the object is 
to be stored. Typically, the object server 48 is selected by default based on the user's identity. 
Library server 44 then sends a request to object server 48 to retrieve the blob from the client 
cache 40 and store it in the object store 50. Included in the request is the handle of the object 
stored in client cache 40 and the item ID, part number and REP type of the part. 

The object server 48, upon receiving the request to retrieve a copy of the object, retrieves 
the copy from client cache 40 and stores it in object store 50, then updates its object server table 
64 accordingly to indicate a file name for the blob stored in object store 50. The file name 
uniquely identifies the location of the blob stored in object store 50. 

Upon successfully storing a copy of the blob, object server 48 sends a response to library 
server 44 to notify it that the object was successfully stored. Library server 44 then updates its 
tables including the parts table 62 to indicate that the object is successfully stored in object server 
48. The library server 44 sends a response to library client 42 indicating that the object was 
successfully stored so that the library client 42 can take further action based on the fact that the 
object was successfully stored in object store 50, such as deallocating memory resources for that 
object in client cache 32. 
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2. Data Model Definition 

Storing content for use in creating a compilation of content first requires defining a Data 
Model, i.e., the constructs for mapping input content and other information in digital library 20. 
The data model is dependent on the constructs available within the underlying data repository. It 
5 is also defined by the nature of the content and information being input. 

The content to be stored comprises products such as books, albums, images and videos. 
The content of each of these products may be organized hierarchically. For example, the 
hierarchy of a book may be defined by its volumes, chapters and chapter subsections. Since it is 
desired to create compilations of content from selected entities of these products, the content of 
10 the input products is partitioned into selectable entities. Information about the hierarchical 
m relationship is also stored in the data repository. In the present example, other information to be 
; 0 stored includes user and content category definitions. 

5 In the present example, the data repository is a digital library that includes a relational 

JS database, and the data model consists of entity groups defining the constructs in which the 
Ml 5 content is to be organized and stored within the relational database. Each entity group includes 
b index class definitions, and may include part definitions. The parts store the actual content, and 
5^ outlines describing the hierarchical relationship of the content entities. The index classes define 
fU relational tables for storing parametric attributes parametric (i.e. Integer, Float, Date, Time, 
Q String, Char, etc.) of the content, programs, and approval requests. The content index classes 
fe 20 further include references to the parts containing them. 

There are four entity groups in the present example: the Product Entity Group, the 
Program Entity Group, the CBO Entity Group and the Request Entity Group. The Product Entity 
Group defines the constructs for storing prepublished works or "products" in the digital library 
20. These products provide the content from which a user can build a compilation of content. 
25 The Program Entity Group defines categories for content. In the present example these 

categories consist of academic programs. For example, "Freshman Engineering" is one program 
defined in the present example. The CBO Entity Group defines the constructs for storing a 
compilation of content. The Request Entity Group defines the contracts for storing information 
about requests for approval of compilations of content. 
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The following tables represent index class definitions, i.e., the meta definitions of the 
index classes. The rows within the figures define the columns of the index classes. For example, 
the Product_Aux index class contains 8 columns: SeqID, Productltem, Parentltem, Siblingltem, 
Childltem, Keyword, Value andNextValueltem. 
5 Each primary index class contains a fixed number of columns. The columns of the index 

class definitions for the primary index classes define the primary index class column name (first 
column from the left), attribute type (second column), and source of the attribute value for each 
column of the index class (third column). In some cases, an attribute value is passed to digital 
library 20 by the loader 14 application, and the second column of the definition table is used to 
10 map the external attribute names to the internal digital library attribute names. In other cases, the 
^ attributes are program generated, as is indicated by the value "program generated" in column 
; 0 two. In the index class definition tables below, a fourth column has been added to each table to 
fj describe each column. It shall be understood, however, that this column is only provided to 
IS facilitate the reader's understanding and is not a part of the index class definitions. 
Gj.5 The primary index class columns are restricted to single value attributes. Those columns 

J that are multivalued or were not known when the system was first created are placed into the 
y" auxiliary index class. 

W The Program Index Class, Product Index Class and Request Index Class each have an 

O associated auxiliary index class (ProgramAux Index Class, ProductAux Index Class, and 
U 20 RequestAux Index Class). Use of auxiliary index classes is generally understood by those skilled 
in the use of digital libraries. Each row within an auxiliary index class defines an additional 
(theoretical) column to a ROW in the corresponding primary index class (NOT to the entire 
primary index class). The column is theoretical in the sense that the digital library 20 does not 
handle auxiliary index class rows as additional columns in the primary index class. Rather, the 
25 API layer 30 provides the mapping mechanism to enable this theoretical column notion. 

Therefore users perceive these auxiliary index class rows as additional columns for a row, but in 
actuality they are stored as rows within the auxiliary index class. Theoretically, the primary 
index class appears as a table containing multiple rows and each row contains the columns 
defined in the primary index class definition plus those columns defined by rows in the auxiliary 
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index class. In other words, these auxiliary index class columns (a.La. theoretical columns) are 
bound to a row within the primary index class and not the primary index class itself. 

The manner in which an auxiliary index class defines theoretical columns on rows of a 
primary index class will now be described with reference to the Product Entity auxiliary index 
5 class. The Productltem column (represented as a row in the auxiliary index class definition, 
below) contains the itemid, a unique identifier for each row in the primary index class. This 
column forms the linkage between a row within the auxiliary index class and the corresponding 
row of the primary index class. 

The keyword column of the auxiliary index class (not to be confused with the Keyword 
10 column of the auxiliary index class definition) represents the name of the theoretical column to 
be added to a row of the primary index class. The current domain of theoretical primary index 
A class column names appears in the Keywords column of the product auxiliary index class 
5 definition, below (2nd column from left). For example, one theoretical column name is 
:| Pub_Med_Type. 

I3l5 Note: In the present example, the domain is not restricted by the digital library 20 other 

T than that the names must not exceed the length of the keyword column definition. Therefore, the 
P domain of theoretical primary index class column names can be continuously enlarged by simply 
111 adding additional columns to the auxiliary index class. 

K The Value column contains the value for the theoretical column identified by the 

H20 auxiliary index class Keyword column. 

In addition to defining additional theoretical primary index class columns, the auxiliary 
index class can store multiple valued theoretical columns and hierarchical theoretical columns. 
Similar to theoretical single valued columns, theoretical multiple valued columns can be 
represented within a relational datastore model by using rows of an auxiliary index class. In the 
25 single valued column, only one row is necessary. In the multple valued column, two or more 
rows are necessary (1 row for each value needing to be stored). Each value in the multiple 
valued column is ordered. This order is then used to chain multiple rows within the auxiliary 
index class together. Furthermore, the NextValueltem column contains the unique identifier of 
the auxiliary index class row which follows in the multivalued chain. 
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For example, if one wishes to store a multivalued column, First_Name with values: Fred 
and Barney and the auxiliary index class row containing Barney in the Value column has a 
unique identifier equal to ABC then the NextValueltem column for the row containing Fred in 
the Value column is ABC. Thus, the NextValueltem serves as the pointer to the next value in the 

5 multivalued chain. 

The Parentltem, Siblingltem and Childltem columns in the auxiliary index class are used 
to store hierarchical attributes of a row. Since a book's data model is hierarchical, the concept of 
hierarchical attribute storage/retrieval is crucial. The Parentltem column of a row in the auxiliary 
index class contains the unique identifier or itemid of another row in the auxiliary index class 
10 that holds a parent attribute of the current row. Similar to the multivalued columns, the children 
^ of a container are ordered (chained together). The unique identifier of the auxiliary index class 
! fi row containing the proceding child is stored in the Siblingltem field. A container' s first child' s 
5 unique identifier is stored in the Childltem column of the container row, thereby constructing a 
Si link between the container and first child, first child and second child and all other children after. 
Ql 5 For example, the AC_Group column in the product auxiliary index class is a hierarchical 

r attribute. AC_Group contains child attributes: ACFORMID and NUMBERAC. This 
!*f inheritance is identifiable by the tabbing of the terms in the keywords column of the figure. Each 
|U AC_Group attribute contains an ACFORMID and NUMBERAC. Therefore the AC_Group is a 
p kind of container. 

W 20 This attribute family is represented by three rows within the auxiliary index class: one 

representing an AC_Group, one representing the ACFORMID and one representing the 
NUMBERAC. The parentltem column for the AC_Group row is blank to indicate that it is a 
parent attribute, whereas the parentltem column for the ACFORMID and NUMBERAC rows 
contains the unique identifier of the AC_Group row. The Childltem column of the AC_Group 
25 contains the unique identifier of the ACFORMID row. The Siblingltem column for the 

ACFORMID contains the unique identifier of the NUMBERAC row. The NUMBERAC row's 
Siblingltem is left blank representing the last child of AC_Group. 

The Product and CBO Entity Groups are associated with Part definitions, since these 
entities define constructs for storing content in the digital library 20. 
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Product Entity Group 

The Product Entity Group includes two index classes: Product Index Class and 
ProductAux (Auxiliary) Index Class. These index classes define the storage model for existing 
content products and their associated attributes to be stored. More specifically, they are used to 
5 generate a Product Index class in a relational database representing the content products, and the 
parts containing the actual content, prerequisite material and hierarchical product outline. 

"Product" in this context refers to an existing content product such as a book, album or 
video. Since users will be selecting excerpts of existing content products to include in a 
compilation of content, each content product is stored as a group of hierarchically related entities. 
10 Entities at each hierarchical level of the work except the lowest is defined by containers. In the 
« present example, the containers are "book", "volume", and "chapter". Each container is 
'fl described by the subentities or "content entities" it contains. For example, each "bookx" 
m container includes references to all chapters denoted by the keyword, "chapter.c", contained in 
;| that textbook product. Similarly, each "chapter.c" container includes references to all sections 
O15 contained in that chapter. The lowest level of the hierarchy is a "section". All three entities 
(book.c, chapter.c and section) are described by a unique sequence identifier. Each entity is 
jhf represented by a row in the Product Index class 
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Product Index Class 

The product index class defines a relational Product Index Class that is populated with a 
row for each content entity. Thus for textbook products the resulting product index class 
includes a row for each book, volume, chapter and section. In addition, each associated 
5 component for an entity is also represented by a row in the index class This index class is used as 
a quick reference for obtaining attribute information about each product entity, as well as a 
reference to the actual part numbers containing the product files. 

Each product entity is assigned a unique identifier or sequence ID. Preferably, the 
sequence identifier further includes intrinsic information about the hierarchical level of the entity. 
10 To illustrate, the sequence ID used to represent textbook components is in the following form: 

I xxxxxxxxxx.cc.ss 

J where XXXXXXXXXX represents a book' s ISBN (International Standard Book Number?), CC 

Cl 5 represents the chapter number (if any) and SS represents the section number (if any). The CC 

* *' and SS portions of a book entity sequence identifier will be zeroes. Similarly, the SS portion of a 

P chapter entity sequence identifier will be zero. Thus the sequence number of a container serves 

f U as a reference to the subentities of that container, since all subentities will share the same ISBN 

K and container reference number. For leaf entities, the sequence number is used as a reference to 

!|s %0 the entity' s actual content in the data repository. 
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Index Class 

Attribute 

Name 


ATR Mapping 


Type 


Description 


SeqID 


SeqJD 


Ext..Alpha [32] 
INDEXED 


Unique sequence identifier tor product entity 


jtiniity l ype 


PSF 


Ext ..Abha [321 


Entitv type, e.g., book, chapter, section 


Parentltem 


Program generated 


Ext..Alpha [16] 


Unique internal ID of any parent entity (e.g. For 
a section entity, the parent would be its chapter 
container) 


biDlingitem 


rrugrdiii gCIlCIalCU 


Fxt Alnha T161 


Unique internal ID of the next sibling entity 
(e.g. For a section entity, the siblings would be 
other sections of the same chapter. 


Childltem 


Program generated 


Ext..Alpha[16] 


Unique internal ID of the first child entity (e.g. 
a chapter entity's children would be the sections 
it contains. 


Auxltem 


r^rogrdm geneiaieu 


Fxt Alnha T161 


Reference to first entry in the auxiliary table for 
this entity 


rrogramijj 


AC PE ID 


Alnha r 41 


Identifier of Program to which the product 
belongs 


Matus 


oiaius ^okjivii^^ 
AC Status (AC) 


Alnha Til 

/Yl Jyilci LAj 


Indicates if entity is available for browse, 
search or use in a CBO 


iitie 


i lue 

AC Title 


Alnha T2501 


Entity title 


oUDtitie 


OUDLILIC 

AC Subtitle 


Alnha T2501 


Entitv subtitle, if any 




TCT3M 
lO-DrN 

AC ISBN 


Alnha HOI 


Product ISBN 




V^L/r\vJlL' 

AC CDAOID 


Ext Alnha T81 


Associated component attribute 


YearOfPub 


Yr_of_Pub 
AC Yr of Pub 


Numeric [1] 




Edition 


Edition 
AC Edition 


Ext. Alpha [2] 


* 


Kevision 


AC Revision 


Alnha [21 




Version 


Content_Ver 
AC Content Ver 


Ext. Alpha [8] 




PubMediaType 




Ext. Alpha [20] 


Media type, e.g., compact disk 


content i ype 


f~ 1 r\ntF>«t Tvnp 

v-uiilciii_ i ypc 
AC_Content__Type 


Ext Aloha T81 


Content type, e.g., SGML 


ContentFilename 


Filename 

AC rrrnnhic Filena 
me 


Ext. Alpha [254] 


Name of file containing the entity's content 


ImageType 


AC Image Type 


Ext. Alpha [8] 


Type of image, e.g., TIF. 


CharCount 


SGML_Char_Cnt 


Numeric [8] 


Number of non-markup characters in content 
fused to calculate CBO price) 


ACJmageCount 


AC_Image_Cnt 


Numeric [3] 


Number of associated component images in 
content 


AvailabilityDate 


Date_of_Availability 
ACJDate_of„Availa 
bility 


Ext. Alpha [10] 


Date entity is available for use 


ExpirationDate 


Date_of_Expiration 


Ext. Alpha [10] 


Date entity is no longer available for use 
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AC Date of Exnirat 
ion 






CreateDate 


CreateJDate 
AC Create Date 


Ext. Alpha [14] 


Date that table entry was created 


CreatedBy 


Created_By 
AC CreatedBv 


Alpha [8] 


Identifier of user who created entry 


LastModifiedDate 


Last_Modified_Date 
AC_Last_Modified_ 
Date 


Ext. Alpha [14] 


Last date entry was modified 


LastModifiedBy 


Last_Modified_By 
AC LastModifiedBy 


Alpha [8] 


Identifier of user who last modified entry 


PageCount 


PageCount 


Alpha [6] 


Actual page count of content (used in CBO 
pricing formula) 



Part Structures & Text Indices 

This table defines the digital library parts used to store each entity. For a row that represents 
a product entity, Part 1 contains the SGML content for a product entity. Parts 5-1 1 are parts 
containing subsets of that content that can be searched by Text Miner. The Text Index column 
contains the Text Miner indices for each of these searchable subsets. For a row that represents an 
entity's associated component, Part 20 contains the actual associated component file, (e.g., images). 



Part No. 


Description 


Text Index 


l 


Content 


None 


5 


Authored Abstract 


EABSTRAC 


6 


Generated Abstract 


EABSTRAC 


7 


Index Terms 


EIXTERMS 


8 


Key Terms 


EIXTERMS 


9 


Entity Structure Part 


None 


10 


Teaching Concepts 


ETEACHCO 


11 


Concepts Topics 


ETOPICS 


20 


Associated Component 


None 
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ProductAux Index Class 

In the present example, the auxiliary index class is used to define additional columns in 
specified rows of the Product Index class Specifically, each label in the Keywords column 
corresponding to the Keyword attribute defines the name of an additional column in the Product 

5 Index class The "value" attribute is the attribute type for each of these keywords. Indentations 
represent nested keywords. The SeqID, Productltem, Parentltem, Siblingltem andChildltem 
attributes specify the rows in the auxiliary Product Index class for storing hierarchical values. In 
the present example, "Index_Term" is an example of a multi-valued attribute, meaning that there 
may be more than one index term defined for each program entity. For performance reasons, the 

10 values of a multivalued attribute may be stored in separate rows of the Product Index class Thus 
The "Next Valueltem" attribute identifies the row of the next item in a set of attribute values. 

1 Multivalued attributes are structured as linked lists when loaded into digital library 20, and this 

% order is maintained in the auxiliary Product Index class 
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Index Class Keywords Type Description 



Attribute 
Name 



SeqID 


PSF 


Ext. Alpha [32] 


Sequence ID of entity that this 
attribute belongs to 


Productltem 


Program generated 


Ext. Alpha [16] 


Unique internal ID of the product 
index class row that this attribute 
belongs to 


Parentltem 


Program generated 


Ext. Alpha [16] 


Unique internal ID of the auxiliary 
product index class row that is this 
attribute's parent attribute 


Siblingltem 


Program generated 


Ext. Alpha [16] 


Unique internal ID of the auxiliary 
product index class row that is this 
attribute's next sibling attribute 


Childltem 


Program generated 


Ext. Alpha [16] 


Unique internal ID of the auxiliary 
product index class row that is the 
first child attribute for this attribute 


Keyword 


Pub_Med_Type 
AC_Counts 

ACFORMID 

NUMBERAC 

Index_Term 

Key_Term 

Contrib_Group 

Contribjtole 
Contrib_Title 
Contrib_First_Name 

l^OnLTlD_lVllUUlC_lN OUIC 

ContribJLastJNTame 

UIlll 1 U iJ UlliA 

Job_Title 

Contrib_Affiliation 
Contrib_Credentials 

Use_Actuals 


Alpha [32] 


Media type (e.g., compact disk, audio 
tape, paper, etc.) 

AC (Associated Component) 
attribute group 

AC type. 

The number for each AC type. 

Index term in a product entity 

Key term in a product entity 

This group defines properties re: one 
contributing author of a prepublished 
book. Since a book can have multiple 
contributors, more than one 
contrib_group of properties can exist 
for that book. 

Switch variable to determine if actual 
or estimated page count is to be used 
in calculating price. 


Value 


Value depends on specific 
attribute keyword above 


Ext. Alpha 
[254] 


Actual value for the keyword above 


NextValueltem 


Program generated 


Ext. Alpha [16] 


Unique ID of the auxiliary product 
index class row that is the next value 
in a multi-valued attribute. 
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Program Entity Group 

It is sometimes desirable to categorize users and content to facilitate the creation of a 
compilation. For example, a system user who wishes to compile an album of classical music is 
not interested in viewing selections from a country music album. Audio content may therefore 
5 be categorized according to music type. The user may also be assigned to a particular category, 
either by default or by personal selection. In a system for creating custom textbooks, subsets are 
organized according to particular programs or disciplines. For example, prepublished textbooks 
may be assigned to categories such as Engineering, Mathematics, English, and so on. In the 
present example, these categories have been defined even more narrowly: Freshman 
10 Engineering, Sophomore Engineering, etc. 

A Program Entity Group is used to define categories or "programs" to which users and 

*C prepublished content can be assigned. 
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Program Index Class 

The Program Index Class definition below defines a Program Index Class or Program 
Index class that is populated with a row for each user/content category. This index class is used 
as a quick reference for obtaining attribute information about each program. 



5 





Index Class 


ATR Mapping 


Type 


Description 1 




Attribute Name 










ProgramJD 


PE_Program_ID 


Alpha [4] 
INDEXED 


Program identifier, e.g. "EE" is the 
identifier for "Freshman Engineering" 




EntityType 


PSF 


Ext. Alpha 
[32] 


Used when programs are nested to define 
hierarchical level of each program entity 




Parentltem 


Program generated 


ExL.Alpha 
[161 


Supporting hierarchical or "parent" 
programs 




Title 


PE Title 


Alpha [250] 


Program Title 




Subtitle 


PE Subtitle 


Alpha [250] 


Program subtitle, if any 




AvgChrPerlmage 


PE_AC_Avg_Image_Bytes 


Numeric [6] 


Average characters per image for products 
within this program 




AvgChrPerSGMLAC 


PE„AC_Avg_SGML_Bytes 


Numeric [6] 


Average characters per SGML associated 
component for products within this 
program 


:: 


MaxChrPerUPMTier 


PE_Chars_Per_UPM_Tier 


Numeric [6] 


Maximum number of characters allowed 
for a UPM in this program 


:: t 


Status 


PE_Status 


Alpha [1] 


Matus indicating wneuier program oniuy 
is currently valid/invalid 




CreateDate 


PE_CreateDate 


Ext.. Alpha 
[14] 


Date table entry created 


IU 


CreateBy 


PE CreateBy 


Alpha [8] 


Identifier of user who created entry 




LastModifiedDate 


PE_LastModifiedDate 


Ext.Alpha 
[14] 


Date entry was last modified 




LastModifiedBy 


PE LastModifiedBy 


Alpha [8] 


Identifier of user who last modified entry 




Siblingltem 


Program generated 


Ext.. Alpha 
[16] 


Related sibling programs providing 
support for hierarchical programs. 




Childltem 


Program generated 


Ext.Alpha 
[16] 


Related child programs, if any, providing 
support for hierarchical programs. 




Auxltem 


Program generated 


Ext.Alpha 
[16] 


Reference to auxiliary table 




SeqID 


PSF 


Ext. Alpha 
T32] 


Unique program identifier, e.g., "FE" for 
"Freshman Engineering" 
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ProgramAux Index Class 



Index Class 


Keywords 


Type 


Description 


Attribute 








Name 








SeqID 


PSF 


Ext. Alpha 


Unique identifier (i.e., Sequence ID) 




[32] 


of this row. 


Programltem 


Program generated 


Ext.Alpha 


Unique internal id oi row witnin 


[16] 


auxiliary program index class that 








this attribute belongs to 


Parentltem 


Program generated 


Ext.Alpha 


Unique internal ID or row within 






auxiliary program index ciass huh un& 








attribute's parent attribute belongs to 


Siblingltem 


Program generated 


Ext.Alpha 


Unique internal ID of row within 


[16] 


auxiliary program index that this 








attribute's next sibling attribute 








belongs to 


Childltem 


Program generated 


Ext..Alpna 


Unique internal ID of row within 




[16] 


miYilifirv nrooram index that the first 








child attribute for this attribute 








belongs to 


Kevword 


PE_Req_Count 


Alpha [32] 


Thf* npYt available uninue identifier 






for a request 




PE_AC_Group 




This ormin defines associated 






component attributes used in the 








nricins formula 




PE„AC_FormID 




AC type 




PE AC ByteCount 




Number of "characters" for that AC 








type 




PE_Price_Group 




This ffrouD defines more attributes 






used in pricing formula 




PE_Country 








PE_Monetary„Unit 




Monetary unit 




PE„Min_Order„Price 




Minimum order price 




PE„Base„Cust„Pub„Price 




oase price auueu 10 evciy niaium 








publication 




PE„Base_UPM_Fee 




Base price added when UPM is 








included 




PEJncrJJPM„Fee 




Additional price per UPM pricing 








block 




PE_Source_Price_Per_Page 




Price per page for prepublished 








content included 
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PE_UPM_Bytes_Per_Page 
PE Volume Page Limit 




Number of UPM characters in a page 

Minimum number of pages required 
in a custom publication 

Maximum number of pages in a 
volume 


Value 


Value depends on specific attribute 
keyword above 


Ext. Alph 
T254] 




NextValueltem 


Program Generated 


Ext.Alpha 
[16] 


Unique internal ID of row within 
auxiliary program index representing 
the next value of a multi-valued 
attribute. 



CustomBookOutline Index Class 

n 5 

5 The CustomBookOutline Index Class defines a relational CBO Index Class that includes 

ft a row for each compilation of content created. Each row further includes a reference to a part 

containing a road map or outline of the compilation of content. The index class is used as a 
O quick reference for obtaining attribute information about a compilation, as well as for locating 
M \0 the corresponding part numbers. Again, the attributes are a matter of design choice. 



Index Class 
Attribute Name 


Source 


Type 


Description 


ProgramID 


Web application 


Ext. Alpha [4] 


Program identifier 


CBOTitle 


Web application 


Alpha [120] 


Custom book title 


ApprovalStatus 


Program generated 


Alpha [1] 


Approval status, i.e., active, submitted, approved, 
rejected or printed 


UPMCharCount 


Program generated 


Alpha [8] 


Character count of any user- provided content 


RightsFee 


Program generated 


Alpha [8] 


License fee 


SGMLPageEstimate 


Program generated 


Alpha [4] 


Estimated page count for SGML content 


TotalPageEstimate 


Program generated 


Alpha [4] 


Estimaged total page count 


PriceEstimate 


Program generated 


Alpha [8] 


Estimated price 


ISBN 


Program generated 


Alpha [10] 


Unique ISBN assigned to the custom book at 
submission time. 


CreatorlD 


Program generated 


Alpha [20] 


Creator's unique identifier 


CreatorTS 


Program generated 


Alpha [14] 


Timestamp representing time of current edit 


LastModifiedTS 


Program generated 


Alpha [14] 


Timestamp representing time last modified 


CBOTerms 


Program generated 


Ext. Alpha [32] 


Name of file containing terms and conditions that 
will apply to custom book? 


ActiveCBOPartID 


Program generated 


Alpha [3] 


Part number of active custom book 


LastUPMPartID 


Program generated 


Alpha [3] 


Part number of the last user-provided material 
added 
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Part Structures & Text Indices 

The part definition describes the parts associated with each compilation. In the present 
example, three parts are defined: part 1 initially containing the custom book outline, part 2 initially 
containing a backup copy of the custom book outline for use in undo operations, and parts numbered 

5 50 or higher containing user provided material (UPM). (Note: After undo, part 2 becomes the active 
CBO, and part 1 is the backup. The attribute value of "ActiveCBOPartED" indicates which is of 
these is currently the active part.) The first UPM added to a custom book is assigned to part 50, the 
second UPM added is assigned part 5 1 , and so on. The last UPM part number assigned is stored in 
the CBO Index class defined above and serves two functions. It is a value that is retrieved and 

10 incremented each time new UPM is added. In addition, it serves as an indicator of how many parts 

!• the custom book currently contains. 



Part No. 


Description 


Text Index 


1 


Part number for Active / 


None 




Inactive CBO 




2 


Part number for Active / 


None 




Inactive CBO 




50+ 


Part numbers for 


None 




user-provided content 





5=i 1 5 Request Entity Group 

Whenever a compilation of content is submitted for publication, the Request Entity 
Group is used to generate an entry in a Request index class corresponding to the submission 
request. A unique ISBN is assigned to the CBO once it has been approved for publishing. 
20 Attributes are a matter of design choice. In the present example, they describe useful information 
about the custom book such as its unique identifier, author, approval status, price, etc. 
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Request Index Class 



Index Class Source Type Description 
Attribute Name 



CBOID 


Program generated 


Ext. Alpha [20] 


Unique Ct>U laentiiier assigned at suumission 
time 


Userid 


Program generated 


Ext. Alpha [20] 


Autnor 


ApprovalStatus 


Program generated 


Alpha [1] 


Cr>u state m tne process 

0 - Active 

1 - oUOmilLCU 
0 _ AnnrnvpH 

3 - Reiected 

4 - Printed 


i otairrice 


JrrOgialll gt/IlCIalCU 


X>r lilllt/i iw l^j 


Price of custom book 


i^ty o tuaenu^ opies 


W CU dppilOallUIl 


Nnmprif* T41 


Number of student copies requested 


V,JtyL>esKCopies 


W CD dppilCaLlUll 


Miimprir* T^l 


Number of desk conies reauested 


(Jtyouppiements 


w eo appncdiiun 


l^Uli.lt'llL' 


Number of books to be used as supplements 


JNeearsyuate 


W eu appilCaLlUIi 


Fxt Alnha HOI 


Date needed bv 


TermStartDate 


wen application 


Pvt Alnha riOl 


^tart date of the school term for which this 
CBO is created 


i ermiN ame 


W CD appilCallUIl 


Fxt Alnha F201 


E 2 Snrins. Fall 


universiiy 


W/^K or\r>1 if^citi rvn 
Wcu appilValltJH 


Fxt Alnha T1001 


University name, e.g., Stanford University 


Department 


weD appiicaiion 




Denartment name, e.g.. Electrical Engineering 


ciassrs ame 


\X/aVi 5inn1ir*ati r\r\ 
VY CL* appiicaiiuii 


Fxt Alnha T1281 


Class name, e.g., Engineering Basics 


Ljassrsumoer 


W/c»K Q-n-nlifatinti 

VV CU <l|jpilL<u.U.*JIl 


Fxt Alnha T121 


Class number 


v^ourseiN uiiiuci 


W/ph annlipatinn 


Ext Alnha ri21 


Course number, e.g., 101 


ShipToNameTitle 


Web application 


Ext. Alpha [12] 




ShipToFirstName 


Web application 


Ext. Alpha [40] 




Ship 1 oEastJName 


weo appncauon 






Snip 1 o AddrLine l 


weo application 


Fxt Alnha T401 


* 


ShinToAddrLine2 


Web application 


Ext. Alpha [40] 


* 


ShipToAddrLine3 


Web application 


Ext. Alpha [40] 




ShipToCity 


Web application 


Ext. Alpha [40] 


* 


ShipToState 


Web application 


Ext. Alpha [20] 




ShipToCountry 


Web application 


Ext. Alpha [20] 




ShipToPostalCode 


Web application 


Ext. Alpha [20] 




PackagelSBN 


Program generated 


Alpha [10] 


The ISBN assigned to the entire book. This 
may be different from the ISBN's assigned to 
volumes within the book. 


CreateTS 


Program generated 


Alpha [14] 


Time entry created 


RequestID 


Program generated 


Ext. Alpha [16] 


Unique request identifier 



* Self-explanatory 
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RequestAux Index Class 

The RequestAux Index Class is used in the present example to add additional columns to 
designated rows of the Request Index class when a CBO contains more than one volume. More 
5 specifically, if greater than one volume exists, the CBO and each volume it contains are each 
assigned a unique ISBN, and the Volume, VolumelSBN and VolumelD columns are added to the 
row representing the submission request. The Requestltem, Parentltem, Siblingltem and 
Childltem attributes are used to identify the row to which these columns are added. 





Index Class 

Attribute 

Name 


Source 


Type 


Description 




Requestltem 


Program generated 


Ext Alpha [16] 


Unique internal ID of row within request index 
class of entity that this attribute belongs to 




Parentltem 


Program generated 


Ext. Alpha [16] 


Unique internal ID of row within auxiliary request 
index class of entity that is this attribute's parent 


li.J li 


Siblingltem 


Program generated 


Ext Alpha [16] 


Unique internal ID of row within auxiliary request 
index class of entity that is this attribute's next 
sibling (siblings are ordered) 




Childltem 


Program generated 


Ext Alpha [16] 


Unique internal ID of row within auxiliary request 
index class of entity that is this attribute's first 
child (children are ordered). 


g-sr . 
ii «? 


Keyword 


Volume 

VolumelSBN 
VolumelD 


Alpha [32] 


The parent attribute of the volume information. 

The child attribute of Volume which stores the 
ISBN of the volume. 

unique internal ID of row within request index 
class of volume entity corresponding to this 
volume. 




Value 


Program generated 


Ext Alpha [254] 




10 


NextValueltem 


Program generated 


Ext Alpha [16] 


Unique internal ID of row within auxiliary request 
index representing the next value of a multi 
valued attribute. 
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Login/Registration Database Model 

The Users Table simply defines a relational table for storing user information. The fourth 
column represents if this is a primary key field and the fifth column represents if this is a foreign 
5 key field. 



USERS Table 





Table Column 


Table Column 


Table 


Table 


Table 




Name 


Datatype 


Column Null 
Option 


Column 
Is PK 


Column 
Is FK 






VAPrHAR9HO^ 


NOT NULL 


Yes 


No 




JJllrl UJJ iU 


rsFTTMRFRf^ 

IN UlVlDCIv^O ) 


NTJT T 


No 


No 




UJNiV UJJ ID 


]sjT TMRF 13 (Sd 


NTJT T 


No 


No 




JJJcJr AK 1 M-bJN 1 ID 


IN U 1V1oj_/1\.^o ) 


NTTT T 


No 


No 




TTNTV ID 




NULL 


No 


No 


. 


USERNAME 


VARCHAR2(30) 


NOT NULL 


No 


No 




PASSWORD 


VARCHAR2(30) 


NOT NULL 


No 


No 




TITLE 


VARCHAR2(100) 


NULL 


No 


No 


. '.ps 


FIRST NAME 


VARCHAR2(30) 


NULL 


No 


No 




LAST NAME 


VARCHAR2(30) 


NULL 


No 


No 


: :ser 5 


ADDRESS 1 


VARCHAR2(80) 


NULL 


No 


No 


m % 


ADDRESS2 


VARCHAR2(80) 


NULL 


No 


No 




ADDRESS3 


VARCHAR2(80) 


NULL 


No 


No 




CITY 


VARCHAR2(50) 


NULL 


No 


No 


§=*=:. 


STATE 


VARCHAR2(2) 


NULL 


No 


No 




ZIP 


VARCHAR2(10) 


NULL 


No 


No 




COUNTRY 


VARCHAR2(50) 


NULL 


No 


No 




PHONE 


VARCHAR2(15) 


NULL 


No 


No 




EMAIL 


VARCHAR2(80) 


NOT NULL 


No 


No 




CHALLENGE 


VARCHAR2(255) 


NOT NULL 


No 


No 




RESPONSE 


VARCHAR2(255) 


NOT NULL 


No 


No 




SECURITY 


CHAR(l) 


NOT NULL 


No 


No 




TTMESTAMP 


DATE 


NULL 


No 


No 




CBO ID 


VARCHAR2(64) 


NULL 


No 


No 



10 USER_CBOS Table 

The USER.CBOS table enables a user to have more than one active CBO at a time. 



Table Column 


Table Column 


Table 


Table 


Table 


Name 


Datatype 


Column Null 


Column 


Column 






Option 


Is PK 


Is FK 


USER ID 


NUMBER(8) 


NOT NULL 


Yes 


No 


CBO 


VARCHAR2(100) 


NULL 


No 


No 


TIMESTAMP 


DATE 


NULL 


No 


No 
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3. Input Interface 

An interface 8 is provided to the user for entering information to be stored in digital 
5 library 20. Information includes Program categories and prepublished content. The interface can 
be in a variety of forms, but it must be able to communicate with an 00 Api layer 30 which is in 
the present embodiment comprises a C dll. The interface 8 of the present embodiment is a web 
based solution consisting of 22, 24, 26 and 28. Alternatively, application code 28 may provide 
the same function. 

10 In the present example, each prepublished content product is input as one SGML file. 

The hierarchical levels within that SGML file are discernible by their delimiting tag types, 
Program information is provided as a field identifying the program for each prepublished content 
product. The program configuration information is defined in PSF/ATR files and loaded into the 
m datastore using the loader 10. 

jb 

O 4. Converter 

^ Converter 10 receives the SGML files and uses the delimiting tags to separate the product 

U entities and associated components. It also builds a file defining the hierarchical relationships of 
[S>0 these entities and components, and extracts relevant product attributes. In the present example, 
2 the resulting files include four possible file types: a Product Structure File (PSF), Attribute Files 
(ATR), Content Component Files and Associated Component Files. 

Product Structure Files (PSF). For content, the Product Structure File provides a 
hierarchical outline of the contents in a prepublished product. More specifically, it is a parsable 
25 formatted file listing all of the entities making up a content product (e.g., a book container, 
volume containers, chapter containers and sections, each identified by its sequence identifier). 
This file is used as a road map (i.e., a list or table of contents) defining the content, order and 
hierarchical structure of the prepublished product, thereby relating a product's separately stored 
content entities. It is stored as a part in digital library 20. An example of a PSF file for a content 
30 product is shown below: 
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PRODUCT.C:01 30808598.00.00.00 

FRONT_AND_BACK_ELEMENT:01 30808598.01 .01 .00 
FRONT_AND_BACK_ELEMENT:01 30808598.01 .02.00 
FRONT_AND_BACK_ELEMENT:01 30808598.01 .03.00 
5 FRONT_AND_BACK_ELEMENT:01 30808598.01 .04.00 

CHAPTER.C:01 30808598.02.00.00 

FRONT_AND_BACK_ELEMENT:01 30808598.02.01 .00 
SECTION:01 30808598.02.02.00 
SECTION:01 30808598.02.03.00 
10 SECTION:01 30808598.02.04.00 

SECTION:01 30808598.02.05.00 
SECTION:01 30808598.02.06.00 
FRONT_AND_BACK_ELEMENT:0130808598.02.07.00 
CHAPTER.C:01 30808598.03.00.00 
15 FRONT_AND_BACK_ELEMENT:01 30808598.03.01 .00 

SECTION:01 30808598.03.02.00 
SECTION:01 30808598.03.03.00 
SECTION:01 30808598.03.04.00 
SECTION:01 30808598.03.05.00 
.,20 SECTION:01 30808598.03.06.00 

°t FRONT_AND_BACK_ELEMENT:01 30808598.03.07.00 

"0 CHAPTER.C:01 30808598.04.00.00 

■£ FRONT_AND_BACK_ELEMENT:01 30808598.04.01 .00 

CO SECTION:01 30808598.04.02.00 

hfl25 SECTION:01 30808598.04.03.00 

rh SECTION:01 30808598.04.04.00 

SECTION:01 30808598.04.05.00 
f5 FRONT_AND_BACK_ELEMENT:01 30808598.04.06.00 

w ' CHAPTER.C:01 30808598.05.00.00 

1.30 FRONT_AND_BACK_ELEMENT:01 30808598.05.01 .00 

^ SECTION:01 30808598.05.02.00 

^ SECTION:01 30808598.05.03.00 

111 SECTION:01 30808598.05.04.00 

N= SECTION:01 30808598.05.05.00 

fi35 SECTION:01 30808598.05.06.00 

n FRONT_AND_BACK_ELEMENT:01 30808598.05.07.00 

CHAPTER.C:01 30808598.06.00.00 

FRONT_AND_BACK_ELEMENT:01 30808598.06.01 .00 
SECTION:01 30808598.06.02.00 
40 SECTION:01 30808598.06.03.00 

SECTION:01 30808598.06.04.00 
SECTION:01 30808598.06.05.00 
SECTION:01 30808598.06.06.00 
SECTION:01 30808598.06.07.00 
45 SECTION:01 30808598.06.08.00 

FRONT_AND_BACK_ELEMENT:0130808598.06.09.00 
CHAPTER.C:01 30808598.07.00.00 

FRONT_AND_BACK_ELEMENT:01 30808598.07.01 .00 
SECTION:01 30808598.07.02.00 
50 SECTION:01 30808598.07.03.00 

SECTION:01 30808598.07.04.00 
SECTION:01 30808598.07.05.00 
FRONT_AND_BACK_ELEMENT:0130808598.07.06.00 

CHAPTER.C:01 30808598.08.00.00 
55 FRONT_AND_BACK_ELEMENT:01 30808598.08.01 .00 

SECTION:01 30808598.08.02.00 
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SECTION:01 30808598.08.03.00 

FRONT_AND_BACK_ELEMENT:0130808598.08.04.00 
CHAPTER.C:01 30808598.09.00.00 

FRONT_AND_BACK_ELEMENT:01 30808598.09.01 .00 

SECTION:01 30808598.09.02.00 

FRONT_AND_BACK_ELEMENT:0130808598.09.03.00 
CHAPTER.C:01 30808598.1 0.00.00 

FRONT_AND_BACK_ELEMENT:01 30808598.1 0.01 .00 

SECTION:01 30808598.1 0.02.00 

SECTION:01 30808598.1 0.03.00 

FRONT_AND_BACK_ELEMENT:01 30808598.1 0.04.00 
CH APTER.C:01 30808598. 1 1 .00.00 

FRONT_AND_BACK_ELEMENT:01 30808598.1 1 .01 .00 

SECTION:01 30808598.1 1 .02.00 

SECTION:01 30808598.1 1.03.00 

FRONT_AND_BACK_ELEMENT:01 30808598.1 1 .04.00 
FRONT_AND_BACK_ELEMENT:01 30808598.1 2.01 .00 



For program categories, the PSF file contains the unique program identifier. As an example, the 
contents of a PSF file for the "Freshman Engineering" program is shown below: 



PROGRAM:fe 



Attribute Files (ATR). Attribute files contain metadata about each program or product 
entity input. This information must be extracted by converter 10. These files are mapped to the 
program and product index class defined by the Program and Product index classes (using the 
ELOADER.INI file described below) and stored in digital library 20. There is one attribute file 
for each program and for each product entity to be stored. Examples of ATR files are shown 
below. The first is an ATR file for a "book": 
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;; PRODUCT.C - ATR file - Created: 29 October 1999 21:55:06 

;; SeqJD: 0130808598.00.00.00 
5 ;; 

!SKU:0000000014595 

!ISBN:0130808598 

!Title:Engineering Success 

!Contrib_Group 
10 ! Contrib_First_Name: Peter 

! Contrib_Last_Name:Schiavone 

! Contrib_Affiliation:University of Alberta 

!PE_ID:FE 

!Status:0 
15 !Page_Count:0 

!Use_Actuals:1 

!Yr_of_Pub:1999 

!Edition:01 

!Revision:00 
,,20 !Version:01.00 
*t !Created_By:BARKER 
■|f !LastModified_By:BARKER 



3:25 The ATR for chapter 3 of the preceding book is shown below: 

111 ;; CHAPTER.C - ATR file - Created: 29 October 1 999 21 :55:09 
O30 ;; SeqJD: 01 30808598.03.00.00 
111 !SKU:0000000014618 

!*& !Title:lntroduction to Engineering and Engineering Study 

O lAuthored _Abstract:“How much do you know about engineering? Why did you choose to study 
K35 engineering?What reasons lead you to believe that you are ready and equipped to study 

engineering?What are the main differences between studying at a university and studying in high 
school?What new success skills do you need to succeed in engineering study?Can you write down 10 
answers to each question I have asked you? Go ahead and try.” 

!Authored_Abstract:This is often how I begin my lecture to freshman engineering students enrolled in an 
40 introductory engineering class. After a little thought, most of them realize just how little they know about 
this subject called engineering and (often despite excellent high school averages) how ill equipped they 
are to study engineering. 

!Authored_Abstract:ln this chapter, we address both issues. First, we ask the following questions:What is 
engineering?What do engineers do?Why choose to study engineering? 
45 !Authored_Abstract:The answers to these questions are not only interesting and informative, but will help 
keep you motivated along the long, hard road to an engineering degree. 

!Authored_Abstract:ln , we address the question, “Are you prepared and equipped for engineering 
study?” In doing so, we examine the study skills required to succeed in the university environment. 
For many students, the university is the next logical step after high school, the next academic challenge. 

50 Consequently, they expect their freshman year in engineering to be much like another year of high 
school-which, of course, it isn’t. In engineering, such an exception often manifests itself in 
unacceptably high first-year attrition rates. We address this issue by focusing on what you need to do to 
ensure the best possible start to earning your engineering degree. Essentially, you must develop the 
necessary:Work strategiesStudy strategiesAttitudesCommunication skillsAbility to work as part of a 

55 teamTime management skills 
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The ATR for section 3.2 of the same book is shown below: 



;; SECTION - ATR file - Created: 29 October 1999 21:55:09 
;; SeqJD: 0130808598.03.02.00 

!Filename:0130808598.03.02.00.sgm 
!CDAOID:AABQHDS0 
!lndex_Term:engineering 
!lndex_Term:defined 
!lndex_Term:engineering, study 
!lndex_Term:introduction to 
!lndex_Term:engineering, defined 
!Title:What Is Engineering? 
!SGML_Char_Cnt:2370 
!AC_Counts 

ACFORMID:2 

NUMBERAC:1 
!Associated_Component 
! AC_PE_ID:FE 
! AC_CDAOID:AABQHDT0 
! AC_Title:FiG1 
! AC_I mage_Type:TI FF 
! AC_Graphic_Filename:HiRes\AABQHDTO.TIF 
! AC_Authored„Abstract:None 
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The ATR file for the "Freshman Engineering" program is shown 



;Program ID for Freshman Engineering set to "FE" 
!PE_Program_ID:FE 
!PE_Title:Freshman Engineering 
!PE_Subtitle: 

!PE_Req_Count:ESOU002300 

; !PE_Related_Material:<vaIue> 

!PE_AC_Group 

! PE_AC_FormlD:1 

! PE_AC_ByteCount:2 

! PE_AC„FormDesc:lnline Graphic 

!PE_AC_Group 

! PE_AC_FormlD:2 

! PE_AC_ByteCount:1 000 

! PE_AC_FormDesc:Display Graphic 

!PE_AC_Group 

! PE_AC_FormlD:3 

! PE„AC_ByteCount:68 

! PE_AC„FormDesc: Inline Equation 

!PE_AC_Group 

! PE_AC_FormlD:4 

! PE_AC_ByteCount:180 

! PE_AC_FormDesc:Disp!ay Equation 

!PE„AC_Group 

! PE_AC_FormlD:5 

! PE_AC_ByteCount:2000 

! PE_AC_FormDesc:SGML 

!PE_AC„Avg_lmage„Bytes:0 

!PE_AC_Avg_SGML_Bytes:0 

!PE_Chars_Per_UPM_Tier:2000 

!PE_Price_Group 

! PE_Country:0 

! PE_Monetary_Unit:USD 

! PE_Min_Order_Price:1000 

! PE„Base_Cust_Pub_Price:1 000 

! PE_Base_UPM_Fee:0 

! PE_lncr_UPM_Fee:10 

;JDR add 1/21/99 

! PE_Source„Price_Per„Page:1 0 
! PE_Minimum_PageJ_imit:80 
! PE_Volume„Page_Limit:480 
! PE JJPM_Bytes_Per_Page:1 000 

!PE__Status:F 

!PE_CreateDate:1 998-1 2-07 
! P E_C reated By : U H AN A E D 
!PE_LastModifiedDate:1 999-1 -1 9 
!PEJ_astModifiedBy:llHANAED 



STL000023US1 



-36- 



Content Component Files (SGML). Content component files contain the product 
entities' actual ASCII or binary content that will be stored as parts in digital library 20. In the 
present example, these files comprise SGML files containing the ASCII text of chapter sections. 
Associated Component Files. Associated Component (AC) Files contain any 
5 non-SGML content associated with the product entities. The content in the associated 
component files is stored as parts in digital library 20. 

Both prepublished content and custom book outlines (CBO's), described below, are 
represented in the described file format. A feature of this format is that content objects such as a 
prepublished book or CBO are defined by the PSF file. Thus the PSF file may be used to 
10 redefine the content, order and structure of the content object without having to access the 

content itself. This feature proves useful in creating compilations of content, by simplifying the 
dp process for adding, moving and deleting content. 

S Composite PSF & ATR Files. Out of the PSF and ATR format comes a third file format 

5 that is a composite form of PSF and ATR. For simplicity, this type is also referred to as a 
Q15 PSF+ATR format. One can think of this file as a merge of PSF and ATR files, where attributes 
* 8 from an entity's ATR file have been inserted after that entity in the .PSF. For example, it may be 
O desirable to include certain attributes with a PSF file (e.g., author and price). It may be desirable 
V\l to add certain attributes to the product structure file (e.g., author) when it is stored in the digital 
^ library. Accordingly, in the present embodiment, what is stored as "Entity Structure Part" 
Q20 described earlier is actually a composite form of PSF and ATR. For a book or product level 

entity, this file includes all entities in the book (including the book itself) and their attributes. For 
a chapter level entity, this file includes all entities in the chapter (including the chapter itself) and 
their attributes. For a section level entity, this file includes the section entity and its attributes. 
Attributes are also added to PSF files containing custom compilation outlines created by system 
25 users, and Equery result files. In the Equery results files, all of the entities returned are treated 
flat, namely at the same hierarchical level. 

An example of a composite file format is shown below: 



STL000023US1 



Top_Entityl: sequence_ID 
! Attribute 1: value 
!Attribute2: value 
!Attribute3: 

! Subattribute 1 : value 
! Subattribute2: value 
[Attributed Value4 

Sub_Entityl: SequenceJD 
! Attribute 1: value 
!Attribute2: value 

Sub_Sub_Entityl: SequenceJD 
! Attribute 1: value 
!Attribute2: value 
Sub_Entity2: SequenceJD 
! Attribute 1: value 
!Attribute2: value 



An exemplaryentity structure part stored in Digital Library is provided in Appendix A. 

In the present example, converter 10 is preferably Active System's Attribute Extractor 
(i.e. AE). Converter 10 creates a load directory for each prepublished content product, identified 
by that product's ISBN, which contains the product's corresponding Product Structure File 
(PSF), Attribute Files, Content Component Files and Associated Component Files. It also 
creates a load directory for each program category, identified by the program identifier and 
containing the program's corresponding PSF and ATR files. These directories are provided as 
input to content loader 14. 

4. Content Loader 

Content loader 14 is a software application for loading the program and prepublished 
content files described above into the digital library 20. It receives the load directories as input 
from converter 10, then loads this information into digital library according to a content 
configuration model 12 defined in the ELOADER.INI configuration file (described below). 
Content loader 14 interfaces with the digital library content server(s) 18 through the 00 API 
layer 16. 

The content loader 14 has three modes of operation: load, delete and purge. 
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Load. The purpose of this mode of operation is to load or reload the Content Entities, 
Associated Components and Attributes into the digital library 20. All Content Component Files 
are stored as binary large objects orBLOBs in the digital library object server 48. All Attribute 
Files are parsed and the resultant parametric data is stored in the digital library server 44. 
5 As previously noted, the input files to the content loader 14 are a Product Structure File (i.e., a 

sequence-id.psf), an Attribute file for each product entity loaded (i.e., sequenceid.atr), a file for each 
Content Component (i.e., sequence-id.sgm) and a file for each Associated Component (i.e., 
sequenceid.cdaOID.gif) 

The output of the ELoader will be placed into the appropriate index class in the digital library 
10 20 as specified by the configuration model contained in the ELO ADER.INI file. 

; y Syntax: ELoader -load <sequence-id> 

$ Example #1: ELoader -load 012345678 

m This load command launches loader 14 into load mode. It looks in the load directory identified by an 

0 ISBN = "012345678" for all of the Attribute Hies, Content Components and Associated Components 

1 15 stored therein, and processes these files. 

Kj Example #2: ELoader 012345678.02.00.00 

O This load command launches loader 14 into load mode. It looks in the load directory identified by an 
ISBN = "012345678" for all Attribute Files, Content Components, and Associated Components 
associated with chapter container "012345678.02.00.00", and process these files. 
20 Delete. The purpose of this mode of operation is to delete selected Content Entities, Associated 

Components and Attributes from the Digital Library. The ELoader will delete all content, attributes, 
and text index entries from digital library 20 for the ISBN / sequence number specified, as well as all 
child content and attributes associated with that ISBN / sequence number. 

25 Syntax: ELoader --delete <sequence-id> 

Example #1: ELoader -delete 012345678 



STL000023US1 



This command launches loader 14 into delete mode and deletes all content and attributes for the 
prepublished content product whose ISBN = "012345678". 

Example #2: ELoader -delete 012345678.02.00.00 

This command launches loader 14 into delete mode and deletes all entities and attributes for the entity 
5 whose sequence number = "012345678.02.00.00" as well as all of its children. The rest of the content 

product remains untouched. 

Purge. The purpose of this mode of operation is to purge Content Entities, Associated 

Components and Attributes from the Digital Library after a Load that did not complete successfully. 

Loader 14 deletes all content, attributes, and text index entries from the digital library 20 even though 
0 it is in a partially loaded state. 

A Syntax: ELoader -purge <sequence-id> 

% Example: ELoader -purge 012345678 

Sr.?" 

i il 

r 15 This command launches loader 14 into purge mode and deletes all content and attributes for the 
j;f prepublished content product whose ISBN = "012345678". 

Q 5. Configuration Model 

Configuration model 12 is embodied in a configuration file called ELOADER.INI, and 
20 associated configuration files that it calls. The configuration files contain all of the switches and 
parameters necessary to customize the operation of loader 14 to the data model defined above 
The primary objective of these files is to minimize the need to change loader 14 program source 
code if the data model is modified. 

The ELOADER.INI file is organized into several sections with multiple keywords and 
25 values in each section. The LOGON and DEBUG sections describe parameters that govern the 
overall loader operation. The ELOADER section and the Individual Group Sections describe the 
entity types that have been defined in the exemplary data model (i.e., the Program, Product, CBO 
and Request entity groups). The Individual Entity Sections describe each entity type that belongs 
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to a given entity group. The ATTRIBUTES section and the Individual Attribute Sections 
describe the set of attributes that may be loaded for the entities in the data model. 

The ELOADER.INI file, the data model file, and each of the individual GROUP attribute 
files are in the same format as an Attribute file as shown in the examples. The GROUP file is in 
the PSF format. 

a. Structure 

LOGON Section. This section specifies the digital library USERID and server names for all 



operations between content loader 14 and digital library 20. 



KEYWORD 


VALUE 


MEANING 


LIBRARY 


name 


The name of the DIGITAL LIBRARY Library 
Server to be used. 


USERID 


name 


The USERID that will be used to logon to 
DIGITAL LIBRARY. 


PASSWORD 


name 


The PASSWORD of the USERID. 


TEXT SERVER 


name 


The name of the client instance of the Text Miner 
server. 


MAX HITS 


number 


The maximum number of hits to be returned by 
EQuery (described subsequently). 



DEBUG Section. This section specifies internal switch settings that are only used for 
debugging, testing, and performance analysis. 



KEYWORD 


VALUE 


MEANING 


TRACE 


0 
1 


No debug trace will be created. 

Create trace of internal activity for debugging. 

This is not a log file. 



Log files Section. This section specifies the names of the files to be used for logging and 
debugging. 



KEYWORD 


VALUE 


MEANING 


LOADER 


name 


Filename for Loader log. 


TRACE 


name 


Filename for debug trace. 
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ELOADER Section. This section specifies the name of the initialization file containing the full 
data model with all of its data groups. In other words, it is a pointer to a meta-metadata file. 



KEYWORD 


VALUE 


MEANING 


DATA MODEL 


Name 


Filename of a file containing each Group name 
and the name of the Group File. 


DEFAULT GROUP 


Name 


Name of the default Group. 


ROOT ENTITY SID 


Name 


String to be concatenated to the unique ISBN of a 
content object to obtain the root sequence-id. 



CONTENT CLASSES Section. This section specifies the digital library content class for each 
of the possible file extensions of associated component files. 



KEYWORD 


VALUE 


MEANING 


DEFAULT CONTENT 
CLASS 


Name 


BINARY if the component contains non-human 
readable data. 

ASCII if the component contains human readable 
data. 


File extension 


Name 


BINARY if the component contains non-human 
readable data. 

ASCII if the component contains human readable 
data. 



Groups File. This section lists the names of all hierarchical groups of entities within the data 
model. All entity instances that belong to the same group will be stored in the same set of digital 
library index classes. This is a convenient way to manage product-related entities separately 
from other business-related entities. There may be one or more GROUPS in a Groups File. 



KEYWORD 


VALUE 


MEANING 


GROUP 


name 


All instances of entities within this Entity group 
will be stored in the same set of digital library 
index classes. The Entity types that belong to this 
group may be specified via ENTITY keywords in 
an Individual Group Section. 
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Individual Group Attribute Files. Each filename in the group attribute files is the value of one 
of the GROUP keywords in the Groups File. It identifies the data model entities that will be 
stored together as a related group and the digital library index class names that will be used to 
store them. There is one Individual Group Section for each GROUP keyword in the Groups File. 





KEYWORD 


VALUE 


MEANING 




ENTITIES CLASS 


name 


The digital library index class name that will be 
used to store all instances of entities that belong 
to this group. 




ENTITY ID 


name 


The digital library attribute name in the 
ENTITIES Index Class for a unique identifier for 
the entitv It is assumed to be unique and an 
index. 


Ss:? 


ENTITY TYPE 


name 


The digital library attribute name in the 
FNTTTTFS Index Class for the digital library 
Type ID of the entity. 




ENTITY PARENT ID 


name 


The digital library attribute name in the 
FNTTTIES Index Class for the digital library Item 








ID of the parent container of this entity. 




ENTITY CHILD ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ENTITIES Index Class for the DIGITAL 
LIBRARY Item ID of the first child of this entity. 


sew 
=*:;. 


ENTITY SIBLING ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ENTITIES Index Class for the DIGITAL 
LIBRARY Item ID of the first sibling of this 

entitv 


1 :i 


ENTITY AUX ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ENTITIES Index Class for the first auxiliary 
attribute of this entitv. 




ENTITY SUBCOMP ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ENTITIES Index Class for the first associated 
component of this entity. 




A QQHP POMP ATTR 

NAME 


n amp 


The DIGITAL LIBRARY Attribute Name in the 
COMPONENTS Index Class for the DIGITAL 
LIBRARY Item ID of the ENTITIES Index Class 
item that owns the COMPONENT item. 




ASSOC COMP ID 
ATTk NAME 


name 


The DIGITAL LIBRARY Attribute Name in the 
COMPONENTS Index Class for the DIGITAL 
LIBRARY Item ID of the ENTITIES Index Class 
item that owns the COMPONENT item. 




ATTRIBUTES CLASS 


name 


The DIGITAL LIBRARY Index Class name that 
will be used to store all attributes that are 
hierarchical or have multiple instances. 




ATTR ENTITY ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ATTRIBUTES Index Class for the DIGITAL 
LIBRARY Item ID of the ENTITIES Index Class 
item that owns the ATTRIBUTE item. 




ATTR PARENT ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ATTRIBUTES Index Class for the DIGITAL 
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LIBRARY Item ID of the parent container of this 
entity. 


ATTR CHILD ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ATTRIBUTES Index Class for the DIGITAL 
LIBRARY Item ID of the first child container of 
this entity. 


ATTR SIBLING ID 


name 


The DIGITAL LIBRARY Attribute Name in the 
ATTRIBUTES Index Class for the DIGITAL 
T TBRARY Item ID of the next sibling container 
to this entity. 


ATTR NEXT VALUE 


name 


The DIGITAL LIBRARY Attribute Name in the 
ATTRIBUTES Index Class for the DIGITAL 
LIBRARY Item ID of the next value of this 
entity. 


Al IK Kh,Y WUKJJ 


HoIHC 


The DIGITAL LIBRARY Attribute Name in the 
ATTRIBUTES Index Class for the attribute 
containing the Attribute's Keyword. 


ATTR VALUE 


name 


The DIGITAL LIBRARY Attribute Name in the 
ATTRIBUTES Index Class for the attribute 
containing the Attribute's Value. 


ENTITY 


name 


Name of the entity type of the root entity. 


ENTITY STRUCTURE 
PART 


Part 

Number 


Specifies the DIGITAL LIBRARY Part Number 
where an internally generated summary of 
attribute values for this entity and all of its 
descendants will be stored. 



ATTRIBUTE Definitions. Each top-level attribute name that may be present in a Group 
Attribute file must have keywords defined. Attributes that are part of an attribute hierarchy (i. 
sibling attributes with a parent attribute) are defined by an ATTRIBUTE keyword within the 
parent's attribute definition. 



KEYWORD 


VALUE 


MEANING 


ATTRIBUTE 




Defines the beginning of a single attribute. There 
may be one or more ATTRIBUTE definitions in 
Group File. Each attribute name that may be 
present in an attribute file must have keywords 
defined. 
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Attribute Keywords 





KEYWORD 


VALUE 


MEANING 




NAME 


name 


The attribute name that will be used to identify 
this attribute. 


■list? 

to 


TYPE 


ENTITY 

COMP 

AUX 

SYS 


The value of this attribute will be stored as a 
Primary attribute in the Entities Index Class of 
the appropriate Entity Group. 
It uses the DIGITAL LIBRARY attribute 
specified by the DIGITAL LIBRARY NAME 
keyword. The value of this attribute will be 
stored as a Primary attribute in the Components 
Index Class of the appropriate Entity Group. 
It uses the DIGITAL LIBRARY attribute 
specified by the DIGITAL LIBRARY NAME 
keyword. 

The value of this attribute will be stored as an 
Auxiliary attribute along with its keyword. 
Depending on its position in the attribute file, it 
will also contain the DIGITAL LIBRARY Item 
ID of either an Entities Index Class or a 
Components Index Class item of the appropriate 
Entity Group. 

The value of this attribute may not be loaded via 
ELoader and it is not explicitly stored in the 
Digital Library. The value or this attribute 
generated by the DIGITAL LIBRARY query 
engine and is available for output by EQuery. 




VALUES 


1 (default) 


This attribute may only have one value. The 
attribute type may be either PRIM or COMP. 
This attribute may have zero or more values. The 
attribute type may only be AUX. The values will 
always be retrieved in the same order that they 
are stored. 




DEFAULT 


INHERIT 

LINK 

NULL 


If a value is not explicitly specified for this 

attribute, the current value of the same attribute 

of the parent entity is used when the entity is 

stored (i.e. early binding). 

If a value is not explicitly specified for this 

attribute, the current value of the same attribute 

of the parent entity is used when the entity is 

retrieved (i.e. late binding). 

If a value is not explicitly speciiiea ior mis 

attribute, the value is assumed to be a NULL 

ctrincr (\ p no binding 




FILE 


0 (default) 
1 


This attribute has a normal text value and is not a 
file name. The value of this attribute is a file 
name. In addition to storing the file name as the 
attribute value, the content of the file is stored as 
a DIGITAL LIBRARY part in the Part number 
specified by the PART keyword. 




PART 


n 


If the PART keyword is specified, the value of 
the attribute is either a long string or the contents 
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of a file (based on the value of the FILE 
keyword). The value of the PART keyword 
specifies the DIGITAL LIBRARY Part Number 
where value of attribute will be stored. The part 
will be stored with the item that contains the 
attribute value. This type of attribute may be 
searched with Text search, but not parametric 
search. 


DL NAME 


name 


The DIGITAL LIBRARY attribute name that will 
be used to store this attribute. The attribute type 
must be ENTITY or COMP. 


SEARCH 


P 
T 
PT 


Allow parametric search. The attribute type must 
be ENTITY. 
Allow text search. 

Allow parametric and/or text search. The attribute 
type must be ENTITY. 


TEXT INDEX 


name 


The name of the Text Miner index that will be 
used to index this attribute value. 


ATTRIBUTE 




The presence of an ATTRIBUTE keyword 
indicates that the parent attribute has child values. 
The top-level attribute type must be AUX. The 
value of this attribute is the aggregation of all of 
the values of the attributes that are defined by all 
of the attribute values that it contains. 
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Example of an ELOADER.INI File: 



ILOGON 

! LIBRARY:LIBSRVRX 
! USERID:chuck 
! PASSWORDxhuck 
! TEXT SERVER:TM 

'.DEBUG 

! TRACE: 1 

!LOG FILES 

! LOADER:Emissary.log 
! TRACE:ETrace.log 

IELOADER 

! DATA MODEL:EMISSARY.GROUPS 
! DEFAULT GROUP:PRODUCTS 
! ROOT ENTITY SID: .00.00.00 

ICONTENT CLASSES 

! DEFAULT CONTENT CL AS S :BIN ARY 

! tiff:BINARY 

! gif:BINARY 

! jpg:BINARY 

! eps:ASCII 

! sgm:ASCII 

! txt:ASCE 
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GROUPPRODUCTS 

GROUP: ProgramGroup 

ProgramGroup.ATR file for the group ProgramGroup: 

! NAME: PROG RAM 

; Index Class control words 

! ENTITIES CLASS:E_Program 

! ENTITY ID:E_SeqlD 

! ENTITY TYPE:E_EntityType 

! ENTITY PARENT ID:S_Parentltem 

! ENTITY CHILD ID:S_Childltem 

! ENTITY SIBLING ID:S_Siblingltem 

! ENTITY AUX ID:S_Auxltem 

! ENTITY STRUCTURE PART:9 

! ASSOC COMP ATTR NAME:Associated_Component 
IASSOC COMP ID ATTR NAME:AC_CDAOID 

! ATTRIBUTES CLASS:E_ProgramAux 
IATTR SEQUENCE ID:E_SeqlD 
IATTR ENTITY ID:S_Programltem 
IATTR PARENT ID:S_Parentltem 
IATTR CHILD ID:S_Childltem 
IATTR SIBLING ID:S_Siblingltem 
IATTR NEXT VALUE:S_NextValueltem 
IATTR KEYWORD:S_Keyword 
IATTR VALUE:S_Value 

;// Data Model "Entity types" 
!ENTITY:PROGRAM 

;// System attributes 
[ATTRIBUTE 

! NAME:Associated_Component 

!TYPE:SYS 

IATTRIBUTE 

!NAME:AC_CDAOID 

!TYPE:SYS 

IATTRIBUTE 

!NAME:Hits 

!TYPE:SYS 

IATTRIBUTE 

!NAME:HitWords 

!TYPE:SYS 

IATTRIBUTE 

!NAME:Rank 

!TYPE:SYS 

IDL NAME:DKRANK 

;// Data Model "Entity attributes" 
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ffl 

IJ1 

;:,?o 



40 



45 



50 



55 
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ATTRIBUTE 

! NAME:PE_Program_ID 

!TYPE:ENTITY 

!DL NAME:E_ProgramlD 

ATTRIBUTE 

!NAME:PE_Title 

!TYPE:ENTITY 

!DL NAME:E_Title 

ATTRIBUTE 

!NAME:PE_Subtitle 

!TYPE:ENTITY 

!DL NAME:E_Subtitle 

ATTRIBUTE 

! NAME:PE_AC_Avg_lmage_Bytes 
!TYPE:ENTITY 

!DL NAME:E_AvgChrPerlmage 
IATTRIBUTE 

! NAME:PE_AC_Avg_SGML_Bytes 
!TYPE:ENTITY 

!DL NAME:E_AvgChrPerSGMLAC 
IATTRIBUTE 

I NAME:PE_Chars_Per_UPM_Tier 
!TYPE:ENTITY 

!DL NAME:E_MaxChrPerUpmTier 

IATTRIBUTE 

I NAME:PE_Req_Count 

!TYPE:AUX 

IVALUES:* 

ATTRIBUTE 

I NAME:PE_Related_Material 

!TYPE:AUX 

IVALUES:* 

ATTRIBUTE 

!NAME:PE_AC_Group 

!TYPE:AUX 

IVALUES:* 

IATTRIBUTE 

! NAME:PE_AC_FormlD 
! TYPEAUX 
IATTRIBUTE 

! NAME:PE_AC_ByteCount 
! TYPE:AUX 
IATTRIBUTE 

! NAME:PE_AC_FormDesc 

! TYPEAUX 

IATTRIBUTE 

! NAME:PE_Price_Group 

!TYPE:AUX 

IVALUES:* 

IATTRIBUTE 

! NAME:PE_Country 
! TYPE:AUX 
IATTRIBUTE 

I NAME:PE_Monetary_Unit 
! TYPEAUX 
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15 



Uo 



ii:335 



40 



45 



ATTRIBUTE 

NAME:PE_Min_Order_Price 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_Base_Cust_Pub_Price 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_Base_UPM_Fee 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_lncr_UPM_Fee 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_Source_Price_Per_Page 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_UPM_Bytes_Per_Page 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_Minimum_Page_l_imit 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_Volume_Page_Limit 

TYPE:AUX 
ATTRIBUTE 

NAME:PE_Status 

TYPE:ENTITY 

DL NAME:E_Status 
ATTRIBUTE 

NAME:PE_CreateDate 

TYPE:ENTITY 

DL NAME:E_CreateDate 
ATTRIBUTE 

NAME:PE_CreatedBy 

TYPE:ENTITY 

DL NAME:E_CreatedBy 
ATTRIBUTE 

NAME:PE_LastModifiedDate 

TYPE:ENTITY 

DL NAME:E_LastModifiedDate 
ATTRIBUTE 

NAME:PE_LastModifiedBy 

TYPE:ENTITY 

DL NAME:E_LastModifiedBy 



STL000023US1 



-50- 



PRODUCTS.ATR file for the group PRODUCTS: 



; Index class info for Products 
[ENTITIES CLASS:tmpResource2 
IENTITY ID:SeqID 
5 IENTITY TYPE:EntityType 

IENTITY PARENT ID:ContainerItem 
IENTITY CHILD ID:ChildItem 
IENTITY SIBLING ID:SiblingItem 
IENTITY SUBCOMP ID:SubcompItem 
10 IENTITY AUX ID:AuxItem 

I ASSOC COMP ATTR NAME:Associated_Component 
I ASSOC COMP ID ATTR NAME:OID 



15 IATTRIBUTES CLASS :tmpAux2 

I ATTR ENTITY IDrEResourceltem 
I ATTR PARENT ID:ContainerItem 
I ATTR CHILD ID:ChildItem 
IATTR SIBLING ID: Siblingltem 
i|20 IATTR NEXT VALUE:NextValueItem 

p IATTR KEYWORD:EKeyword 

Ul IATTR VALUE:EValue 

P ; Data Model Root Entity Types for Products Group 

[^25 !ENTITY:Product.c 

l£ ; Data Model Attributes for Products - i.e., mapping of metadata properties 

; from PSF attribute files to data locations in the DL data repository 

ATTRIBUTE 

NAME:TITLE 
TYPE:ENTITY 
DL NAME:Title 

ATTRIBUTE 

NAME:PRICE 
TYPE:ENTITY 
DEFAULT:INHERIT 
DL NAME:CharCount 



:ssk:. 



30 



35 



40 



45 



IATTRIBUTE 

NAME:FTLE NAME 
TYPE:ENTITY 
DL NAME:ContentFileName 
FILE:1 
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PART: 1 

ATTRIBUTE 

NAME:INDEX TERM 

TYPErAUX 

PART:5 

TEXT INDEX:TDCTERM 

ATTRIBUTE 

NAME:ITEM INDEX 

TYPE:AUX 

PART:5 

TEXT INDEX:TTXTERM 

ATTRIBUTE 

! NAME: AUTHOR 

! TYPE:AUX 

! DEFAULT:INHERIT 

! ATTRIBUTE 

! NAME:NAME 

! TYPE:AUX 

! PART:6 

! TEXT INDEX :TDCTERM 



! ATTRIBUTE 

! NAME:SCHOOL 

! TYPE:AUX 

! ATTRIBUTE 

! NAME:Associated_Component 

! TYPE:COMP 

! ATTRIBUTE 

! NAME:OID 

! TYPE:COMP 

! DL NAME:ObjectId 

! ATTRIBUTE 
! NAME:TITLE 
! TYPE:COMP 
! DL NAME:Title 

! ATTRIBUTE 
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NAME:SIZE 
TYPE:AUX 



ATTRIBUTE 

NAME:COMP FILE NAME 
TYPE:COMP 

DL NAME:ContentFileName 

FILE:1 

PART:1 



10 




m 

01 

ill 
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B. Selecting Content for a Compilation of Content 

The selection path for creating a compilation of content is shown in Fig. 6. This path 
allows a user to interface with the digital library 20 to retrieve and view content objects stored 
5 therein, select objects for inclusion in a compilation of content, create new objects for inclusion 
in the compilation and for storage in the digital library 20, and submit the completed compilation 
for approval. 

In brief, block 22 represents a user interface application 22 which preferably runs within a 
standard web browser. It comprises HTML and Javascript applications that provide a user 
10 interface and some amount of application function such as searching, viewing, selecting, 

creating, editing, and organizing content accessed from the content server(s). The user creates a 
Q compilation in the form of a custom content outline (CCO), which is essentially a formatted text 
"f~ document that includes pointers to the actual content to be included in the final compilation. In 
the current example for creating custom textbooks, the CCO is called a custom book outline or 

ms cbo. 

■ s £ The user-interface application 22 communicates through a web server 26 to an 

^ application layer 28. Application layer 28 preferably comprises a set of PERL applications that 
N control some user interface transactions (e.g., login procedures), retrieve data for presentation to 
L*L' the user, perform CCO manipulation and submission, and forward commands to the API Layer 
*^20 30 to communicate actions requested by the user. 

Application layer 28 accesses the content server(s) 18 via API layer 30. The API layer 30 
preferably consists of a collection of C++ routines that perform discrete functions such as the 
actual CCO manipulation functions and digital library 20 functions (e.g., search and retrieve). It 
also includes a PERL/C++ glue layer between the C++ routines and application layer 28 for 
25 bridging parameter lists between C++ and PERL. The API layer 30 is provided to map digital 
library 20 more closely to the customer's website and application program workflow. 
Underneath, this API 30 makes use of the digital library API 16 to query/update/delete and 
retrieve data from digital library 20. 
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1. Custom Book Outline 

Prior to submission, a custom book outline is preferably an abstract representation of the 
compilation of content being created. For example, the CBO may be a hierarchical outline of the 
contents to be included in a compilation of content. At this point, it contains only references to 
the actual content to be included in the final work. This format is more efficient than pulling in 
content at creation time because it avoids retrieval and manipulation of large BLOBs of 
information until the CBO is in its final form. 

In the present example, the CBO at creation time is a formatted text document comprising 
a parsable formatted file like the "PSF" filetype previously described. Like the content product 
PSF files, the CBO is merely an outline with references to the content entities to be included 
therein. One difference is that a CBO may be a composite PSF+ATR filetype, including 
attributes particular to the CBO such as author and price. This is the case in the present 
embodiment. 

"Entities" once again refers to the content hierarchy definitioa For example, the 
hierarchy definition of a textbook includes containers representing the higher levels of the 
hierarchy (cbo.c, volume.c and chapter.c). The smallest entity of the hierarchy is a section. 
Each entity in the CBO is represented by a sequence ID in the same format as previously 
described with reference to product entities. The sequence ID of a container entity is used to 
identify all subentities of that container. The sequence ID's of a leaf node is used to reference the 
actual content associated with that node. 

A CBO according to the present example is stored as a digital library part Its attributes 
are also contained in a row of a relational CBO index class defined by the Custom Book Outline 
Index Class, and its unique identifier for this particular implementation is stored in the User 
Table, although it could also be stored in the CBO index class The User Table contains this 
reference for the purpose of identifying the current CBO a user is working with. This allows for 
the user to log off and log back in and return to the previous CBO "work in progress". The row 
in the CBO index class includes references to the CBO part number, as well as any associated 
parts. 

Fig. 7 depicts a row 82 representing a CBO. It includes a CBO identifier, CBO attributes, 
and pointers to one or more PSF files or "parts" associated with the CBO. The first part contains 
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the parsable formatted text outline representing the compilation of content, which in turn 
includes references to actual product content making up the CBO. A second part comprises a 
backup downlevel copy of the first part that is used to undo previous transactions. A third part, 
designated with the number 50 or higher, represents any user-provided content that has been 
5 added to the CBO. Each part of this type includes pointers to the actual user-provided content, 
which is stored in digital library 20. 

An example of a CBO is shown below. 



CBO.C:OW1 T8$U EB4H3 @ S E7 
10 !PE_ID:FE 

!Title:Student Loans 

!Userid:DaveBaer 

!Undoable:FALSE 
^ !Product_Type:CBO 
'Si 5 !Create_Date:20001 209203630 
ti !Last_Modif ied_Date:20001 21 41 1 361 5 
J: !Status:0 

m !CBO_State_Changed_Date:20001 209203630 

* !UPM_Terms_And_Conditions_Date:20001 214 11 :36:1 3 

0 C20 [Acknowledgement: 

O !Contrib_Group: 

!Price:2216 
, !UP!VLPrice:1000 
s*s !Page_Count:21.8 
225 !Char_Count:186 
Ci. !Nextchapter:2 
I y !ECtlSGMLChrPerPage:3800 
|sfe !ECtlAvgChrPerlmage:0 
Q !ECtlAvgChrPerSMGLAC:0 
O30 !ECtlMaxChrPerUPMTier:2000 

!ECtlSourcePricePerPage:1 0 

!ECtlUPMBasePrice:1000 

!ECtlUPMIncrPrice:10 

!Country_Code:US 
35 !PE„Volume_Page__Limit:480 

!PE_Minimum_Page_Limit:80 

!PE_Min_Order_Price:1 000 

!UPM_Bytes_Per_Page:1 000 

!Base_Cost:1000 
40 VOLUME.C:V1 
!UPM_price:0 

!Title:My New ESource Book created on 12/09/2000 at 20:36:28 Volume Number 1 
!Price:216 
!Product_Type: 
45 !Publication_Media_Type: 
!Page_Count:21.8 

FRONT_AND_BACK_ELEMENT: 
!Title:Table of Contents 
!Page_Count:6 
50 !Price:60 
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FRONT_AND_BACK_ELEMENT: 

'Title: Preface 

!Page_Count:9 

!Price:90 

CHAPTER.C:C1 

!Title:New Chapter 

!Price:16 

!Page„Count:1.8 

SECTION:01 37842244.02.02.00 

ITitle: Background Ideas 

!SGML_Char_Cnt:2111 

!PE_ID:FE 

!Page_Count:0.6 

!lnfo_Generated:1 

!Price:6 

SECTION:01 37842244.02.03.00 

!Title:Why Study Engineering Ethics? 

!SGML_Char_Cnt:3905 

!PE_ID:FE 

!Page_Count:1.0 

!lnfo_Generated:1 

! Price: 10 

UPM SECTION:50 

!Title:My New UPM Title 

!SGML_Char_Cnt:186 

!AC_Subdoc_Cnt:0 

!AC_lmage_Cnt:0 

!Page_Count:0.2 

!Price:0 

FRONTAND_BACK_ELEMENT: 

!Title:lndex 

!Page_Count:5 

!Price:50 



Attributes are identified by the "!" character. The first line contains a 16 byte CBOJD, i.e., 
This is the itemid defined previously in the discussion of the digital library 20. The CBOJD is 
the unique identifier for each row in the CustomBookOutline index class. 

The outlined structure is primarily defined by lines below the Volume.C container 
definition and includes the title of the book, a table of contents, a chapter container, two product 
sections and their corresponding sequence identifiers, and a user provided section and its 
corresponding part number (UPM SECTION: 50). The text above the Volume.C is defining 
author, pricing, and other desired variables. This format is parsed by the Edit.cgi script within 
application layer 28 and displayed to the user on a web interface 22 screen. 

A CBO according to the present example is presented to the system user through the web 
user interface 22 as will be described subsequently. 
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2. Application Layer and API Layer 

Application layer 28 and API layer 30interoperate to execute a group of functions 
defined to enable creation, submission and request processing of a CBO. The functions are listed 
5 below and described subsequently with reference to specific CBO tasks. Each function is a 
logical procedure, preferably implemented in C++, that defines a set of function rules and 
includes calls to digital library functions necessary to perform the function (e.g. Queries, writes, 
etc.). In other words, each function is an encapsulation of a series of oft-repeated functional 
steps which facilitates the creation and processing of a CBO. Digital library function calls are 
10 made through the digital library Client OO API 42. A definition of digital library 20 OO API 
function calls utilized by API layer 30 is provided in the "IBM Digital Library Programming 
Reference Version 2", Second Edition (September 1997), and rules for making these function 
fp calls are provided in the "IBM Digital Library Guide to Object-Oriented and Internet Application 
■S Programming Version 2", Second Edition (September 1997), both available from IBM 
015 Corporation. 

s Each function in API layer 30 has a corresponding function in application layer 28. In the 

^ present embodiment, the application layer functions are implemented as Perl scripts. These 
ill scripts receive Perl input parameters from Perl function calls residing in the application layer 28. 

The Perl scripts translate the Perl arguments received into equivalent C++ parameters and then 
M20 issue calls to their corresponding C++ functions in API layer 30. The API layer functions then 
execute the actual function steps and return C++ output parameters to their corresponding Perl 
functions. The Perl functions then translate the C++ arguments into equivalent Perl arguments. 
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CBO Functions 

ECBOCreate 

ECBOCIone 

ECBOAddContent 

ECBONewContainer 

ECBORemoveContent 

ECBODefete 

ECBOMoveContent 

ECBORelocateContent 

ECBOSetTitle 

ECBOGet 

ECBOList 

ECBOListByCreator 

ECBOUndo 

ECBOSetState 

ECBOAddUPM 

ECBOGetUPM 

ECBOSetTerms 

ECBOSubmit 

CBOCheckPolicies 

Prepublished Content Product Support Functions 

EProductGetOutline 

EProductSetState 

EGetHTML 

Miscellaneous Support Functions 

EAdminPopulatePageAndPrice 

EDLAuthCheck 

EGetProgramAttributes 

It is worthwhile noting at this point that the API layer 30 also includes Request functions. 
In the present embodiment the Request functions (described with reference to the CBO approval 
path, below), are called by scripts residing in Request Generator 32 rather than application layer 
28. However, this architecture is a matter of design choice and the skilled artisan will appreciate 
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that the Request function calls may be contained in the application layer 28 without departing 
from the essence of the invention. 

Request Processing Support Functions 

5 

ErequestUpdateStatus 
ErequestAssignlSBN 
ErequestList 
ErequestGet 
1 0 ErequestExistsforCBO 
ErequestGetForCBO 

O 3. Creating a Custom Book Outline 

if 15 A user creates a custom book outline by selecting content stored in the data repository 20 

4f through a web-based user interface 22. The user is presented with input screens that lead him 
yl through registration, system login, and CBO creation, manipulation and submission. The format 
I ji and content of the screens is defined by HTML passed to the web browser 26 from functional 
:L modules residing in application layer 28 that are invoked by the user's actions. The format may 
N20 be changed if the system is modified for different purposes. Javascripts in the web browser 26 
LI accept the HTML and use it to build web pages according to known methods. The Javascripts 
;::f add static elements such as vertical and horizontal navigation bars. 

The sequence of events for creating a custom book outline, including a description of the 
user interface input screens and their underlying functional modules, will now be described. 

25 

Login/Create New Account 

A user entering the system of the present invention is first presented with a login screen 
as shown in Fig. 8A. The format and content of the login screen are defined by a login 
30 procedure. The screen includes input fields 104, 106 for receiving a username and password. 

If the user is new to the system, he must first register for a username by clicking on the 
register button 102. Clicking on register button invokes a registration procedure and brings up a 
new screen as shown in Fig. 8B containing a plurality of input fields. The user is prompted to 
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enter a desired username and other information that will be stored by the system in connection 
with the user and may later be used to configure the CBO user interface. In the present example, 
this information is tailored to a university professor who will be creating a textbook in a 
particular discipline and includes, for example, his name, address, university, department, 
5 challenge question and secret response. It includes most of the information listed in the USERS 
Table data model. 

Once the user's information is entered, the user clicks on button 1 10. The information is 
retrieved and forwarded with additional information such as a timestamp for storage in an 
account database. The account database may be the relational database 46 associated with library 
10 server 44, for example. Alternatively, an auxiliary database may be provided that is linked 

directly to the application layer 28. In the present example, an auxiliary database was used. The 
1 0 database stores the information according to the template provided by the USERS Table data 
rp model. 

:J|f A confirmation screen is displayed next as shown in Fig. 8C with the userid and a 

Ol5 temporary password. The user may now log into the system by clicking on the login button 1 12. 

Ill 

This action causes the user to be returned to the login screen of Fig. 8 A and control to return to 

the login procedure. The user then enters his new id and password. 
Ill New and returning users complete the login process by clicking on button 108. This 

h action causes a query to be issued to the account database to determine if the username and 
lsJ 20 password match the USERJD and PASSWORD columns of any rows in the USERS Table. If 

there is no match, an error message is returned to the user and he is prompted to reenter his ID. If 

there is a match, the user information is retrieved. This information includes an active CBO_ID. 

For returning users, it represents the last CBO that the user was working on. 

In the case of a new user logging onto the system for the first time, theECBOCreate, 
25 EDLAuthCheck and ECBOSetState functions described below are invoked. 
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ECBOCreate 

Description 

This function creates a new, skeleton CBO, populates whatever data that it knows at this time, and 
then stores it into the Digital Library. This procedure is invoked whenever a user: 

1 . logs into the web site for the first time 

2. clears an active CBO 

3. adds a new CBO to his list of CBO's 



Functional Processing 

1 . Verify that the PROGRAM ID passed is valid. 

2. Create a new CBO with the user's userid and generate a temporary title. The CBO is a parsable formatted 
text (i.e., PSF) file. 

3. Mark the CBO status as active (CBO Status = "1 ") with a call to ECBOSetState. 

4. Extract the Program variables related to pricing and virtual page count , storing them in the CBO (described 
subsequently). 

5. Write the CBO to the digital library 20. Storing a copy of the CBO in the client cache is automatically 
handled by the digital library client 16. 

6. Parameter List 



Inputs 


Username 




Password 




Hostname 




Program id 




Userid 




Title 




Distribution country 




Monetary unit 




Composition set 


Outputs 


CBO id 



Index Classes 

• CustomBookOutline 

• Program 

• ProgramAux 
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EDLAuthCheck 

Description 

Before performing any operation, make sure the system-provided datastore login parameters are 
valid. 

Functional Processing 

I. Verify that the datastore username, password and hostname passed are valid by connecting to the DL 
datastore. 

II. Disconnect from the digital library datastore. 
Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 


Outputs 




Errors 


RCJDS_NOCONNECTION 

RC_DS_NOTAUTHORIZED 

RC_SYSTEM_ERROR 



Index Classes 
None 

ECBOSetState 

Description 

This function changes the approval status of a CBO as it moves through the workflow from 
creation, to submission, to approval/return/rejection, to printing through the back-end custom 
publishing process. 

Functional Processing 

I. Verify that the CBO id passed is valid. 

II. Verify that the CBO status passed is valid ("1 "2", "3", or "4"). 
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III. Update the CBO status in the active CBO. 

IV. Remove all Order Requests for the active CBO if the CBO status is being set to Active. 
Parameter List 



Inputs 


DSUsemame 

DSPassword 

DSHostname 

Userid 

CBOid 

CBO state 


Outputs 




Errors 


RC_SYSTEM_ERROR 



Index Classes 

• CustomBookOutline 

• Request 
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An new CBO PSF file will look like this: 

CBO.C:OW1T8$UEB4H3@SE7 
!PE_ID:FE 

5 !Title:My New ESource Book created on 1 2/09/2000 at 20:36:28 
!Userid:DaveBaer 
!Undoable:FALSE 
!Product_Type:CBO 
!Create_Date:20001 209203630 
10 !Last„Modified_Date:20001 209203630 
!Status:0 

!CBO_State_Changed_Date:20001 209203630 

!UPM_Terms_And_Conditions_Date: 

lAcknowledgement: 
15 !Contrib_Group: 

!Price:1000 

!UPM_Price:0 

!Page_Count:0 

!Char_Count:0 
: f 120 !Nextchapter:1 
| !ECtlSGMLChrPerPage:3800 
f !ECtlAvgChrPerlmage:0 
|i !ECtlAvgChrPerSMGLAC:0 
t !ECtlMaxChrPerUPMTier:2000 
^25 !ECtlSourcePricePerPage:10 
f !ECtlUPMBasePrice:1000 
J !ECtlUPMIncrPrice:10 
ll !Country_Code:US 

!PE„Volume_Page_Limit:480 
330 !PE_Minimum_Page__Limit:80 
1 !PEJMin_Order_Price:1000 
*i !UPM_Bytes_Per_Page:1000 
T" !Base_Cost:1000 
Z VOLUME.C:V1 
:!f35 !UPM_price:0 

^ !Title:My New ESource Book created on 12/09/2000 at 20:36:28 Volume Number 1 

!Price:0 

!Product_Type: 
!Publication_Media_Type: 
40 !Page_Count:0 



In addition, a cookie is created and stored in the web browser's memory including the 
user's name, userid, password and active CBO_ID. This cookie is used to establish connections 
45 to the content management system as necessary and transparently to the user, according to 
conventional methods. 

A welcome screen is presented to the new user by a MYBOOKOUTLINE procedure, as 
shown in Fig. 9A. A first section 1 1 1 identifies the user and information about the user - in this 
case her university name, department and guest access level. A second section 113 lists the 
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user's new custom book outline. The list includes a "My Custom Book" column containing a 
temporary title and creation date, a "Last Date" column reflecting the CBO' s last date of 
modification, a "Status" column indicating that the CBO is active, and a "Request ID" column 
that is null at this point to indicate that the CBO has not been submitted for approval. 
5 The user now has two options: 1) he may select the new CBO for modification by 

clicking on the temporary CBO title, which is a hypertext link to the new CBO.; or 2) he may 
create another CBO by clicking on CBO button 118. Button 118 once again invckes the 
ECBOCreate procedure. Clicking on the CBO title causes the MYBOOKOUTLINE procedure 
to issue a call to the ECBOGet procedure to retrieve the CBO formatted text from the digital 
10 library 20. It then parses the text and displays the empty CBO to the user as shown in Fig. 10A. 
The ECBOGet procedure is described below. 

CO ECBOGet 

ifi Description 

Q15 

IJl The content of the custom book outline / CBO is returned. 

!*? Functional Processing 

[Jf 1. Verify that the CBO id passed is valid. 
J:rf 20 2. Retrieve content of the CBO. 

3. Add Preface and Table of Contents front matter elements to the very beginning of the CBO: 

4. Add the Index back matter element to the very end of the CBO: 

5. Pricing and virtual page counts are pre-determined for these front or back matter elements and are 
hard-coded at this time. The attributes are adjusted for the affected Volumes as well as the CBO totals. 

25 Parameter List 



Inputs 


DSUsername 

DSPassword 

DSHostname 

Userid 

CBOid 


Outputs 


CBO description 


Errors 


RC_SYSTEM_ERROR 
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Index Classes 

• CustomBookOutline 

5 

The custom book outline is displayed in a first portion 122 of the screen of Fig. 10A. It 
includes the temporary title and creation date 128. A "Content" column 130 shows the default 
format that initially includes the volume, table of contents, preface and index. A price column 
132 reflects the price of the compilation calculated thus far, as is determined by prices associated 
10 with the elements of column 130. Similarly, a "Pgs" column 134 reflects a page count calculated 
so far, as determined by page estimates for the elements of column 130. 
m The user may elect to return to the Welcome page by clicking on "CBO" button 1 19. 

i; 0 Section 124 of the screen is provided to allow the user to add new chapters to the CBO. This 
CO function will be described in more detail subsequently. Section 126 enables the user to change 
^Jl5 any container (book, volume, chapter) or noncontainer (section) title by selecting the entity type 
G via pull-down menu 125 and inserting the new title in entry field 127, then clicking on "Change" 
s: button 129. In the present example, let us assume that the user wishes to change the book title to 
tl "Internet Basics". Entity type "book" has been selected in menu 125 and the desired title has 
fU been inserted in field 127. By clicking on button 129, the ECBOSetTitle procedure is invoked, 
£320 as described below. 

ECBOSetTitle 

Description 

25 This modifies the book and container/chapter titles of the active CBO to whatever the user 

specifies. 

Functional Processing 

1 . Verify that the CBO id passed is valid. 

30 2. Verify that the target sequence id passed exists in the CBO. Otherwise, the target becomes the CBO itself. 

3. Set the title of the target element within the CBO. 

4. When renaming, determine if the title is protected and cannot be changed 
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5. When renaming, the title text is validated by Javascript on the web page 

6. When renaming, the "<" and ">" symbols will be changed to HTML equivalents: "&lf ' ; "&gf 

Parameter List 



Inputs 


DSUsername 




DSPassword 




DSHostname 




Userid 




CBOid 




Target id 




Title 


Outputs 


New CBO description 


Errors 


RC_SYSTEM_ERROR 



Index Classes 

• CustomBookOutline 

Fig. 10B shows the CBO with its new title. Let us assume that the user now wishes to 
add preexisting content to the CBO. 

Adding Prepublished Content to a CBO 

Them are two methods for selecting preexisting content viewing a "bookshelf ' of content available to the 
user, or searching the stared content Searching shall be described in more detail subsequently. 

To access a bookshelf, the user clicks on the 'library" tab 136 on the vertical menu bar. This action 
invokes aBrowseBookshelf procedure to retrieve all or a subset of the piepublished content stoned in the digital 
library 20. For convenience, a subset or ''bookshelf may be tailored to the particular user. In the present example, 
content is grouped into academic programs such as Engineering, Mathematics, English, etc. Each program is 
assigned a unique ProgramID, and each program entity 5 s ProgramID is stored in the Product Index class defined by 
the Product Entity Index Classes. Useis arc assigned to a program based upon department information entered at the 
time of system registration. 

An exemplary bookshelf tailored to an Engineering program is shown in the user-interface screen of Fig. 
11. It includes a list of prepublished books whose titles arc hypertext links to their corresponding PSF files. By 
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clicking on one of these titles 138, the user invokes the EProductGetOutline procedure call which retrieves the PSF 
file, parses it, and display it to the user as shown in 11 The EProductGetOutline function is described below: 

EProductGetOutline 

Description 

This returns attribute information about an existing product stored in the digital library. It allows 
calling applications to retrieve the only the attributes that it needs or all attributes. It will return 
attributes for all entities within the product hierarchy, exposing the product outline. Sysadmin 
prepubcall when book becomes public. Returns entire outline/browse of bookshelf 

Functional Processing 

I. Search for any existing prepublished content product with the product id passed. 

II. Verify that there is exactly one match for the product id. 

III. If the attribute list is not null, only those attributes in the list are of interest. 

A. Retrieve the requested attributes along with the various levels of entities in the existing copyright 
product 

B. If an attribute of interest is not found for the existing copyright product, an attribute value of null is 
assumed. 

IV. If the attribute list is null, a// attributes are of interest. 

A. Retrieve all attributes along with the various levels of entities in the existing copyright product. 

B. Retrieve the program-specific attributes related to pricing and virtual page count. 

C. Calculate and update the pricing and virtual page count information for each level of entities in the 
existing copyright product. 

V. Export the attributes and entities description into the output buffer. 
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Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Product id 
Attribute list 
Distribution country 
Monetary unit 
Composition set 


Outputs 


Product description 


Errors 


RC_SYSTEM_ERROR 



5 Index Classes 



ifi • Product 
m • ProductAux 



^ • Program 
IfSlO • ProgramAux 



71 The selected bookis displayed by cha^^ A chapter rrmy be expanded to 

^ y view its sections by clickingon its corresponding expand button 141. In Rg. 12, chapters 1 and7havebeen 
Q expanded. Each chapter and section is provided with a select box 145. The user selects content for inclusion in his 
" S 15 castom book by checking the boxes next to the chapters and sections he desires. Si the example of Fig. 12, Chapters 
1 146 and 7 148 have been selected According to the present embodiment, a user can not add chapters and 
sections at the same time. However, it should be apparent to the user that this implementation is merely a matter of 
design choice, ant that if desired, chapters and sections could be added at the same time. 

Once selection is complete, the user clicks on the 6 'Add toBook"button 144 on the vertical menu bar. This 
20 action adds the selected text to the CBO by invoking the ECBOAddContent procedure described below. 
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ECBOAddContent 



Description 

There are two types of content: (chapters and sections) also referred to as containers and 
non-containers that can be added using this function. ECBOAddContent defines separate 
5 processes depending on the content type being added. 

This function adds pre-published content into the CBO once the user has selected which content he 
wants to include in his CBO. The function verifies rights and pre-requisite content before the 
content is included. Additional volumes are constructed on an as needed basis. 

Functional Processing 

10 1 . Verify that the CBO id passed is valid. 

2. Check the country code in the CBO. 

3. Verify that the CBO chapter location reference is valid if specified. 

4. Verify that each piece of content does not already exist in the CBO. 

5. For content containers (e.g. Chapter): 

Ol 5 a) Retrieve the piece of content by its content id (sequence number), 

ifi b) Verify that the piece of content is indeed a container. 

E c) Validate the right to distribute it . 

d) Validate the right to combine it with other contents in the CBO. 

e) If the CBO chapter location reference is valid, add the content container after the chapter 
(f20 location reference. 

% f) If the CBO chapter location reference is null or invalid, add the piece of content to the end of 

the CBO. This situation occurs when a CBO contains no chapters (a.k.a. empty). 



: , 6. For content components (e.g. Sections): 

! p25 a) Retrieve the piece of content by sequence id. 

N :: b) Verify that the piece of content is indeed a non-container. 

Ill c) Validate the right to distribute it. 

y : d) Validate the right to combine it with other contents in the CBO. 

g e) If the CBO chapter location reference is valid, add the piece of content to the end of the chapter 

S30 location referenced, immediately before any back matter elements in that chapter. 

f) If the CBO chapter location reference is null or invalid, return error. 

7. For content components (e.g. Section), process prerequisites listed one at a time: 
a) Retrieve the piece of prerequisite content by sequence id. 
35 b) Validate the right to distribute it. 

c) Validate the right to combine it with other contents in the CBO. 

d) If the piece of pre-requisite content does not already exist in the CBO, add it to the end of the 
chapter location referenced, immediately before any back matter elements. Mark it as 
pre-requisite, so it cannot be deleted later. 

40 e) If the piece of pre-requisite content already exists in the CBO, but it has never been 

pre-requisite before, downgrade it to the pre-requisite status so it cannot be deleted later. 
Location of the piece of pre-requisite content remains unchanged, 
f) If the piece of pre-requisite content already exists in the CBO, and is already in pre-requisite 
status, update the pre-requisite information with the current requiring content id. Location of the 

45 piece of pre-requisite content remains unchanged. 

9. Perform volume management (described subsequently). 

10. Perform pricing and page count calculations (described subsequently). 

50 Parameter List 
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inputs 


uo use marne 




LJo r ass wo ru 




n Q 1-1 rtotn a m a 

uonosuicuii© 




Usend 




CBO id 




oontent la list 




Content id count 




fVintflinpr flan 




CBO chanter location reference 


Oi itni itc 


Mow PRO HpQPrintinn if OK' 






Errors 


nr> mat cr\\ tMn 




DP HI IPI IPATC MOT AI 1 OWFPt 




nr> PHMTCMT MOT fHAPTFP 




KU_UDU_NU_VL/LUIvlt 




HOJ N V ALI U_UtJ\J 




RC_BAD„CHAPTER 




RC_BAD_CONTENT 




RC_1MVALID_PREREQ 




RC_NOJ)IST_RIGHTS 




RC_NOT_COMB_RIGHTS 



^ Index Classes 

m 5 • CustomBookOutline 
^ • Rights 
I ^ • RightsAux 
O • Prereq 

• PrereqAux 
10 • Product 

Steps 6.d) and 7.c) above are performed to prevent mutually exclusive content entities stored in 
the digital library from being included the same compilation of content. The substeps necessary to 
perform this mutual exclusion are defined subsequently. When a content entity is added to a CBO, the 
Product Index class is referenced to determine if the content entity is mutually exclusive of any other 
1 5 content entities. If it is, then the CBO is checked to determine if any of the mutually exclusive content 
entities already exists in the compilation. If not, the content entity is added to the compilation, but if a 
mutually exclusive content entity is present, the add fails and the user is given an error message. 

After the selected content is added to the CBO, the user is returned to the CBO screen and is 
able to view his new additions (see Fig. 13 A). In the present example, the ECBOAddContent 
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procedure has retrieved and inserted the titles of all of the sections 150 of selected chapter 1 146 and the 
selected sections 152 of Chapter 7 148, and has assigned new chapter and section numbers accordingly. 
It has also retrieved and inserted references to front and back matter 154 associated with the selected 
content (e.g., introductory text, summary text and sample problems). Selection boxes 157 are provided 
next to each chapter title and section title to facilitate further editing of the CBO. 

Let us now assume that the user wishes to add his own content to the CBO. 

Adding User-Provided Content to a CBO 

User Provided Material (UPM) is content an instructor contributes to a CBO that is not part of 
the pre-published content. UPM consists of two components: title and content. The title of the UPM is 
inserted into the CBO's index class of contents and added to the content outline. The content is stored 
as a part in digital library 20. UPM can be added at either the chapter or section level. In other words, 
UPM can be siblings of chapters or sections. 

The user first selects a location at which the UPM content is to be inserted using 
pull-down menu 155 (see Fig. 13). In the present example, the user may select to add the UPM 
to one of the existing chapters or to a new, untitled chapter. New chapter is selected in this case 
(not shown). The user then clicks on the "Add Content" button 156. This action invokes a UPM 
procedure which displays a "My Custom Content" screen as shown in Fig. 14. The user enters a 
title in title field 158, and the UPM content in entry window 160. The content may be designated 
as standard text or as a code sample. Once the content has been entered, the user clicks on button 
1 62 to add the content to the CBO. This action invokes the ECBOAddUPM procedure described 
below. In the present example, ECBOSetTerms procedure is also invoked the first time the user 
creates UPM. 
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ECBOSetTerms 



Description 

The first time an UPM is created in the active CBO, the user is first asked to read the UPM terms 
and conditions (e.g., copyright ownership, etc.). If the user accepts them, the dateAime stamp of 
5 the acceptance is recorded with the CBO. 



Functional Processing 



I. Verify that the CBO id passed is valid. 

II. Set the UPM terms and conditions date/time stamp to the time string passed in the active CBO. 



1 0 Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Userid 
CBO id 
Time string 


Outputs 




Errors 


RC_SYSTEM_ERROR 



H Index Classes 

I™ • E CustomBookOutline 



ECBOAddUPM 

Description 

20 This function adds the UPM that a user has entered into the web application into the CBO. 

ECBOAddUPM validates the designated insert location and stores the UPM with its associated 
location attributes in the digital library. The UPM title and size are stored within the CBO part. The 
UPM text is stored in a separate part The UPM is stored as a part whose number is determined by 
a UPM counter initialized to 50. This function performs the following tasks. 



25 Functional Processing 

1 . Verify that the CBO id passed is valid. 

2. Verify that the container id passed is valid, if specified. 

3. Verify that the UPM id passed is valid, if specified. 

4. If the UPM id passed is null, this is a UPM add operation: 

30 a) Create a new UPM with the passed UPM title and text blob as the content. The UPM Title is 

placed into the CBO part and the content is placed into a new UPM part having a unique 
identifier defined by the attribute, SJ_astUpm Parti D which is an incremental counter. 
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S_LastUpmPartlD, resides in the E_CustomBookOutline index class and contains the UPM id 
used in the creation process. Once the UPM is stored within DL, this attribute is incremented. 

b) If the container id passed is null or invalid, insert the UPM at the end of the last chapter in the 
CBO, before any back matter elements in the chapter. 

c) If the container id passed is valid, insert the UPM at the end of the container referenced, before 
any back matter elements in the container. 

d) Note that an UPM add operation is always in the context of a container. The UPM can be 
moved outside the container as needed. 

5. If the UPM id passed is not null, this is an UPM update operation. 

a) Verify that the UPM does exist in the CBO. 

b) Update the content of the UPM with the UPM text blob passed. 

c) The location of the UPM remains unchanged. 

6. Set the UPM Title attribute in the CBO to the title passed. 

7. Volume size management is performed. If an add or update causes the total virtual page count of 
the current volume to go above or below the volume virtual page count limit, the volumes are 
reorganized. 

8. Pricing and virtual page count calculations are performed for each add or update operation. The 
net results are stored as attributes for the affected Chapter, Volumes as well as the CBO totals. 



Parameter List 



Inputs 


DSUsemame 




DSPassword 




DSHostname 




Userid 




CBO id 




Container id 




Title 




UPM id 




UPM text blob 


Outputs 


New CBO description 


Errors 


RC_NOT_FOUND 




RCJNVALIDCBO 




RC_SYSTEMJERROR 



After the UPM has been added to the CBO, the user is returned to the CBO screen where 
he can view the newly added material 164 (see Fig. 15 A). 

User-provided material may be edited by clicking on the UPM title, which is a hypertext 
link to the UPM part. This action invokes the ECBOGetUPM procedure to retrieves the UPM 
part and display it in editing window 160 of the "My Custom Content" screen (Fig. 14). 
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ECBOGetUPM 

Description 

This will return the UPM content of an UPM section, including all of the UPM elements. 

5 Functional Processing 

I. Verify that the CBO id passed is valid. 

II. Verify that the UPM id passed is valid. 

III. Retrieve the content of the UPM. 

10 A. If there is one single element in the UPM, retrieve its content into a text buffer. 

B. If there are multiple elements in the UPM, retrieve all their contents in one single text buffer, in the 
^ order they appear in the UPM. 

5i F Parameter List 



hi 5 



Inputs 


DSUsername 

DSPassword 

DSHostname 

Userid 

CBOid 

UPM id 


Outputs 


UPM text blob 


Errors 


RC_SYSTEMJ3RROR 



Index Classes 

• CustomBookOutline 

20 Once the user has entered his desired changes, he clicks on button 162 and once again invokes 
the ECBOAddUPM procedure. This time, the procedure recognizes the UPM as an update and 
follows the alternate processing path described above. 

Adding New Containers to the CBO 

25 

Fig. 15B represents the lower portion of the "My Book Outline" screen, including the 
"Add" and "Change" sections 124, 126 previously described. We will now assume that the user 
wishes to add a custom container (i.e., chapter) to the CBO in Fig. 15 A. To do so, he enters a 
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desired container title in field 121 and clicks on the "Add" button 124. This action invokes the 
ECBONewContainer procedure described below. 

ECBONewContainer 

5 Description 

This function will add a new container (e.g. chapter) into the user's active CBO. It does not add 
content to the container. 

Functional Processing 

10 

I. Verify that the CBO id passed is valid. 

II. Verify that the container type passed is valid. 

!:|| III. Create a new empty container at the end of the CBO. 

Ql IV. Set the title attribute to the container title if specified; otherwise set the title to the default, 
il 5 Parameter List 



Inputs 


DSUsername 




DSPassword 




DSHostname 




Userid 




CBO id 




Container type 




Container title 




Output flag 


Outputs 


New CBO description if output flag 0; 




New container id if output flag 1 


Errors 


RC_NOT_FOUND 




RC_INVALID_CONTAINER_TYPE 



Index Classes 

20 

• CustomBookOutline 

The new container 165 is added to the CBO as shown in Fig. 16. We will now assume 
that the user wishes to rearrange the entities in his CBO. 
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! 

Moving and Relocating Content 

There are two separate processes involving the movement of content within a CBO. These 
processes are: moving and relocating. Moving content involves some piece of content moving up or 
down by one item within a CBO. Relocation involves moving a piece of content to some other location 
5 within the CBO. Sections, UPM and chapters are the only entities permitted to be moved within a 
CBO. For a movement or relocation task to complete without error, the following rules must be 
obeyed. 

1. A section must reside within a chapter 

2. A chapter must reside within a volume 

s^l 0 3 . UPM must reside within either a chapter or a volume 
+; 4. A chapter/UPM is not permitted before a volume's front matter 
yl 5. A chaptei/UPM is not permitted after a volume's back matter 
" f s 6. A section is not permitted before a chapter' s front matter 
^ 7. A section is not permitted after a chapter's back matter 

Q 15 8. All content must move/relocate without error, or none of the content is moved/relocated. 

Content is moved within a CBO via the "My Book Outline" display screen (see Fig. 16). A 
user selects the content to be moved by checking its corresponding select box. In the present example, 
section 1.3 has been selected 166. He then specifies a target location using drop down box 168. Within 
the box the user may select to move an entity up or down (by one of the same entity), select an existing 

20 chapter from a current list of chapter titles, or create a new untitled chapter. In the present example, the 
user has selected to move section 1 .3 to existing Chapter 4. After selection, the user clicks on the 
"Move Items" button 170. 

If the user selects to move content up or down, the ECBOMoveContent function is invoked. If 
he selects to move content to a different chapter or a new chapter, the ECBORelocateContent function 

25 is invoked. Both functions are described below. Both procedures cause page and price totals to be 
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recalculated and stored in digital library via the EAdminPopulatePageAndPrice procedure, also 
described below. 



ECBOMoveContent 

Description 

This function will move content up and down within the CBO per the instructors' direction. This 
function permits moving one or more pieces of similar content in the same call. Similar content is 
defined to be either all sections and UPM's or all chapters, but not hybrids of the two. It will make 
sure that all rules are adhered to and will also request Volume Size Management when necessary. 

Functional Processing 

1 . Verify that the CBO id passed is valid. 

2. Verify that selected content is either all chapter containers or all sections, but not a mixture of both. 

3. Contents listed are moved one at a time. For moving up, the contents are processed in the order listed. For 
moving down, the contents are processed in the reverse order. 

4. Verify that the piece of content does exist in the CBO. 

5. Verify that the piece of content is not a front or back matter element. Front or back matter elements can only 
be moved implicitly when an entire chapter is moved. 

6. When moving containers down (e.g. Chapter): 

a) Verify that after this piece of container content in the CBO, there is at least one more piece of content at 
the same level (e.g. Chapter, UPM Section outside a chapter). 

b) Move the piece of container content to the position after that next piece of content. 

7. When moving content components Sections down: 

a) If it is the last section besides any back matter elements in a chapter, verify that chapter is not the last 
chapter in the CBO. 

b) Move the piece of content down by one position, but skip over any front or back matter elements. If 
leaving a chapter, it is moved to the beginning of the next chapter. 

8. When moving content component UPM Sections down : 

a) If it is outside chapters, verify that it is not the very last element in the CBO. 

b) Move the piece of content down by one position, but skip over any front or back matter elements. If 
leaving a chapter, move it to the position immediately after the chapter thus at the same level as a 
chapter. If entering a chapter, move it after any lead front matter in the chapter. 

9. When moving containers up : 

a) Verify that before this piece of container content in the CBO, there is at least one more piece of content 
prior to it at the same level (e.g. Chapter, UPM Section outside a chapter). 

b) Move the piece of container content to the position before that previous piece of content. 

10. When moving content components up: 

a) If it is the first section besides any front matter elements in a chapter, verify that chapter is not the first 
chapter in the CBO. 

b) Move the piece of content up by one position, but skip over any front or back matter elements. If leaving 
a chapter, it is moved to the end of the previous chapter. 

11. When moving content component UPM Sections up : 

a) If it is outside chapters, verify that it is not the very first element in the CBO. 

b) Move the piece of content up by one position, but skip over any front or back matter elements. If leaving 
a chapter, it is moved to the position immediately before the chapter thus at the same level as a chapter. 
If entering a chapter, move it before any trailing back matter in the chapter. 

If a move causes the total virtual page count of the current volume to go above or below the volume 
virtual page count limit, the volumes are reorganized. 

12. Pricing and virtual page count calculations (described subsequently) are performed for each move operation 
and stored in digital library via the EAdminPopulatePageAndPrice procedure. The net results are stored as 
attributes for the affected Chapters, Volumes as well as the CBO totals. 
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Parameter List 



Inputs 


DSUsemame 




uo r assworu 




DSHostname 




Userid 




CBO id 




Content id list 




Content id count 




Container flag 




Move flag 


Outputs 


New CBO description if OK; 




Offending content ids if error 


Errors 


RC_NOT„FOUND 




RC_INVALID_CBO 




RCJNVALlDJvlOVE 



Index Classes 

• CustomBookOutline 



ECBORetocateContent 
Description 

This will relocate content to a specific location within the same CBO that was specified by the 
instructor. It is different than a move, which moves content one position at a time, yet it adheres to 
the same rules and has much of the same processing. 

Functional Processing 

1 . Verify that the CBO id passed is valid. 

2. Verify that the destination container id passed exists in the CBO. 

3. Contents listed are relocated one at a time in the order provided. 

4. Verify that the piece of content does exist in the CBO. 

5. For content containers (e.g. Chapter), add it after the container specified. 

6. For content components (e.g. Section, UPM Section), add it to the end of the container specified, before any 
back matter elements in that container. 

If a relocate causes the total virtual page count of the current volume to go above or below the 
volume virtual page count limit, the volumes are reorganized. 

7. Pricing and virtual page count calculations (described subsequently) are performed for each relocate 
operation. The net results are stored in the digital library via the EAdminPopulatePageAndPrice function as 
attributes for the affected Chapters, Volumes as well as the CBO totals. 
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Parameter List 



Inputs 


DSUsername 




DS Password 




ub Hostname 




i jcpiiH 




ORO id 




Content id list 




Content id count 




Container id 


Outputs 


New CBO description if OK; 




Offending content ids if error 


Errors 


RC_NOT_FOUND 




RCJNVALIDCBO 



Index Classes 

• CustomBookOutline 



EAdminPopulatePageAndPrice 

Description 

Calculate and update the pricing and virtual page count information of either an existing 
prepublished content product or a CBO. For prepublished content products, this function is called 
at load time. For CBO's, it is called every time a CBO is modified in a manner affects the CBO 
page count or price. 

Functional Processing 

I. Import content of the input filename, which is description of either one existing prepublished content product 
or one CBO. 

II. Retrieve the program-specific attributes related to pricing and virtual page count. 

III. Calculate and update the pricing and virtual page count information for each level of entities in the 
prepublished content product or CBO description. 

IV. Export the updated description as content of the output filename. 
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Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Input filename 
Output filename 


Outputs 


(Content of the output file 
named) 


Errors 


RC SYSTEM ERROR 



5 Index Classes 

• Program 
M • ProgramAux 

^10 In the present example, "General Netiquette" section L3 166 is relocated to Chapter 4 by 

CP the ECBORelocateContent procedure, as shown in Fig. 17. Affected sections have been 
m renumbered. 



Deleting Content from a CBO 

1:15 Content may be deleted from the custom book outline by checking the select box next to 

r. 

O the entity to be deleted and then clicking on the "Remove" button 172 on the vertical menu bar 
(see Fig. 16). This action invokes the ECBORemoveContent procedure, described below. 



ECBORemoveContent 

20 Description 

This will remove content from an active CBO. It After the instructor selects sections / chapters in 
his active CBO, he may remove them from the CBO. It also will ensure that no content is removed 
that is a prerequisite to another. It will merge content back into a single volume if possible. 



25 



Functional Processing 

1 . Verify that the CBO id passed is valid. 

2. Content comes in two categories, containers and non-containers. Selected content must be either all 
containers or all content components, but not a mixture of both. 
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3. Contents listed are removed one at a time from the active CBO. 

4. Verify that the piece of content does exist in the CBO. 

5. Verify that the piece of content is not a pre-requisite of any contents in the CBO, other than those being 
removed this time. 



6. Remove the piece of content from the CBO. 

7. If a remove causes the total virtual page count of the current volume to go below the volume virtual page 
count limit, check whether it is possible to move some or all its content elements to the previous volume. If 
after the adjustment the current volume becomes empty, the current volume is deleted. If subsequent 
removes cause the total virtual page count of the current volume to go below the volume virtual page count 
limit, further adjustments are made as needed. 

8. Pricing and virtual page count calculations are performed for each remove operation. The net results are 
stored as attributes for the affected Chapter, Volumes as well as the CBO totals. 



Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Userid 
CBOid 
Container flag 
Content id list 
Content id count 


Outpu 

ts 


New CBO description if OK; 
Offending content ids if error 


Errors 


RC__NOT_FOUND 

RC_PREREQ_ERROR 

RC_INVALID_CBO 



Index Classes 



• Custom BookOut line 

Certain entities can be protected from deletion. In the present example, these include 
automatically generated components such as the Table of Contents, Preface, and Index. They 
also include front and back matter associated with the content sections inserted by the user. 
Consequently, these entities are not provided with select boxes. 
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A user may clear all contents of an active CBO by clicking on the "Clear Book" button 
178. This action invokes a simplified version of the previous function, which essentially selects 
all entities, including front and back matter, UPM's and titles, and removes them from the CBO. 
The actual implementation is a call to the ECBODelete function, followed by a call to the 
ECBOCreate function. The user is left with an empty CBO shell like that of Fig. 10A. 

ECBODelete 

Description 

This function discards all contents of an active CBO. 

Functional Processing 

1 . Verify that the CBO id passed is valid. 

2. Remove the CBO entirely from the datastore. 



Parameter List 



Inputs 


DSUsername 

DSPassword 

DSHostname 

Userid 

CBOid 


Outputs 




Errors 


RC_SYSTEM_ERROR 



Index Classes 

• CustomBookOutline 

Undoing Changes to CBO 

With some exceptions, a user may undo the last change to the content of his active CBO. 
Referring to Fig. 17, this is accomplished by clicking on the "Undo Last" button, which invokes 
the ECBOUndo procedure described below. 
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ECBOUndo 

Description 

This function undoes the last change to the content of the active CBO. Note that this is a single 
level undo capability. 

For each CBO, two versions of the CBO content are maintained as separate parts in the digital 
library. As each CBO operation is performed, if the operation will change the content of the CBO, 
the active version before the change is saved as the backup version. And the new active version 
will include the change. Later on when the undo operation is required, the backup version is 
restored into the active one. 

Functional Processing 

I. Verify that the CBO id passed is valid. 

II. Verify that the active version of the CBO is available. 

III. Verify that the backup version of the CBO is available. 

IV. Set the Undoable status to False in the backup version. 

V. Mark the backup version as the new active version of the CBO. 

VI. Remove the old obsolete active version of the CBO. 

VII. The following CBO operations can be undone: 



A. 


ECBOAddContent 


B. 


ECBONewContainer 


C. 


ECBORemoveContent 


D. 


ECBOMoveContent 


E. 


ECBORelocateContent 


F. 


ECBOSetTitle 


G. 


ECBOSetState 


H. 


ECBOSetTerms 



VIII. The following CBO operations cannot be undone: 

A. ECBOCreate 

B. ECBODelete 
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C. ECBOUndo 

D. ECBOAddUPM 

E. ECBOSubmit 

IX. The following CBO operations have no impact on the undoable status since there was no change to the 
CBO: 

A. ECBOGet 

B. ECBOList 

C. ECBOListByCreator 

D. ECBOGetUPM 

Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Userid 
CBO id 


Outputs 


New CBO description 


Errors 


RC_CBO_NOT_UNDOABLE 
RC_SYSTEM_ERROR 



Index Classes 

• CustomBookOutline 

Editing an Existing CBO 

Returning users may already have CBO's stored in the digital library 20 that were created 
in previous sessions. Thus at logon, the logon procedure invokes the ECBOListByCreator 
procedure, which issues a query to the digital library 20 to find any CBO entries in which the 
E_CreatorID column matches the userid. The ECBOListByCreator function is described below. 
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ECBOListByCreator 

Description 

This will return a list of CBO's created by a particular user within a Program ID. 

Functional Processing 

1 . Verify that program id passed is valid. 

2. Search for CBO's with the program id and the creator id specified. 

3. For each CBO found, output the CBO id, Type, Title, Creator id, Modification timestamp, and CBO status. 
Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Program id 
Creator id 


Outputs 


CBO id list 


Errors 


RC_INVALID_OR_MISSING_INPUT 
RC_SYSTEM_ERROR 



Index Classes 

• CustomBookOutline 

• Program 

Fig. 9B is an example of a welcome screen that may be presented to a returning user by 
the ECBO ListByCreator procedure. The Custom Book Outlines section 113 lists CBO's that the 
user created in previous sessions. 

The user now has three options. He may 1) select an active CBO for modification by 
clicking on its title, 2) "CLONE" and modify an existing CBO by clicking on the CBO's 
corresponding CLONE button 1 16, or 3) create a new CBO by clicking on button 118. 

To modify an existing CBO, the user clicks on its title. This action invokes the EDIT 
procedure, which retrieves the corresponding CBO and associated parts from digital library 20 
(but not the actual content). The CBO is parsed and displayed to the user in the same format 
previously described with reference to editing a new CBO. In Ms case, however, there will 
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already be entities present in the outline. The user may now add, move, or delete content as 
previously described. 

The user may wish to create a new version of a CBO while preserving the original CBO 
This can be achieved by invoking a "CLONE" function. In brief, the "CLONE" function makes 
5 a duplicate copy of an existing CBO which the user may then modify as desired. The user merely 
clicks on the "CLONE" button 116 corresponding to the CBO to be copied. This action invokes 
the ECBOClone procedure described below. 



ass 



ECBOClone 

10 Description 

This function will copy an existing CBO to a new CBO for a user. It copies all UPM and 
pre-published material contained within the CBO. Users clone CBOs when then they: 



I. Click on the Clone button within the Login page 
=y II. Click on the Clone button within the MyBookOutline page 



13 1 5 Functional Processing 

I. Copy the existing CBO to a new CBO with user's userid and title specified, 
[i II. Mark the new CBO's status as Working (CBO Status = "1 "). 



! 5 = 



y . III. Extract the Program variables related to pricing and virtual page count, storing them in the CBO. 

J^20 The estimated page count and prices from the original CBO are kept on the new CBO. 

IV. Writes the CBO to the digital library 



Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Program id 
Userid 

Existing CBO id 
New CBO Title 


Outputs 


New CBO id 


Errors 


RC_INVALID_OR_MISSING_INP 
UT 

RC_SYSTEM_ERROR 
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Index Classes 

• CustomBookOutline 

• Program 

• ProgramAux 

Viewing a CBO 

When the user has completed creating his custom book outline, he may view its contents 
by clicking on "Submit Request" button 180 on the vertical menu bar (see Fig. 16). This action 
invokes View procedure in application layer 28. Briefly, the view procedure displays the custom 
book in a manner as shown in Fig. 18, i.e., in a table of contents format wherein the section titles 
are hypertext links 190 to files containing the actual section content files retrieved. By clicking 
on any section title, the user invokes the EGetHTML procedure. In brief, the EGetHTML 
procedure retrieves the part containing the SGML contents of the selected section and passes 
them to an SGML to HTML converter. An example of an SGML to HTML converter suitable 
for use according to the present embodiment is the publicly available Omnimark Konstructor. 
The SGML to HTML converter performs the conversion using a data type dictionary, which 
defines the SGML content. The EGetHTML procedure is described in more detail below. 

EGetHTML 

Description 

This function retrieves the SGML content for a particular sequence id and then passes the content 
blob to an SGML to HTML converter (e.g., Konstructor). Search words are also passed so that the 
converter can highlight found words. It communicates over a TCP socket to Konstructor. 

The low resolution associated components for ail sections are stored on the front-end web server 
for fast retrieval on presentation. Therefore, the content does not have to make a digital library call 
for each image, speeding up retrieval. 

Functional Processing 

I. Construct the header that contains tags for the sequence id, title hit words and index hit words. 
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II. Search for a product entity with the sequence id passed. 

III. Retrieve the SGML part of this entity. 

IV. Connect to the Konstructor server using the SGML hostname / IP address and port number passed. 

V. Send the header information that contains tags for the sequence id, title hit words and index hit words. 

VI. Send the SGML data for translation. 

VII. Receive from the Konstructor server the HTML blob resulted from the translation. 

VIII. Disconnect from the Konstructor server. 

Parameter List 





Inputs 


DSUsername 






DSPassword 


■ ifV 




DSHostname 






Userid 






SGML hostname or IP 






address 


| 




SGML port number 






Sequence id 


Hj 'i 




Title hit words 






Index hit words 




Outputs 


HTML blob 




Errors 


RC_SYSTEM_ERROR 


trio 




-1 



Index Classes 
• Product 



15 
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4. Page and Price Calculation 

The Page and Price calculations are performed by EProductGetOutline and the CBO routines 
using a set of algorithms against data that is stored in the digital library in both the Program and Section 
entities. This estimated page count and price can be overridden with actual page counts from the 
original pre-published sections by system administrators. 
Inputs 





CBO 


UPMBYTES 


Derived number of bvtes of UPM in the CBO 




E_Program 


AvgChrPerlmage 


Average characters per image 






AvgChrPerSGMUVC 


Average characters per SGML Associated Component 






MaxChrPerUpmTier 


Number of characters in an UPM pricing block 




E_ProgramAux 


PE_Base_Cust_Pub_Price 


Base price to be added to every custom publication 






PE_Base_UPM_Fee 


Base price to be added whenever UPM is included 






PE_lncr_UPM_Fee 


Additional price for each UPM pricing block 


sTr: 




PE_Source_Price_Per__Page 


Price per page for pre-published content included 






PE_UPM_Bytes_Per_Page 


Number of UPM characters in a page 






PEJvlinimum_PageJJmit 


The minimum number of pages required in a custom publ. 






PE_Volume_PageJ_i mit 


The maximum number of pages in a volume 






PE_AC_FormJD 


The associated component type defined in . 


= Is 




PE_AC_ByteCount 


Number of "characters" for that associated component type 




E_Product 


lmage_Type 


Type of image, as defined in 






ImageType 


Number of non-markup characters in content 






ACJmageCount 


Number of associated component images in content 






ACSubdocCount 


Number of SGML subdocuments in content 






PageCount 


Actual page count of content 




E ProductAux 


AC_Counts.ACFORMID 


The associated component type defined in . 






ACCounts. NUMBERAC 


The number for each associated component type 






Use_Actuais 


Switch to determine if content actual page count is used 



10 Outputs 

None 

It is desirable to construct a page and price calculator for the purpose of estimating page and 
pricing information for a CBO. The following page and price calculator described below was devised 
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for this purpose. Each time the ECBOGet function is called to retrieve a CBO for displaying, the Page 
and Price calculation routines are run for the CBO. 

Actual Page and Price calculations occur several times throughout the life of a CBO, e.g., Page 
and Price calculations for Chapters, Volumes and the entire CBO occur any time there is an operation 
5 performed on that CBO which includes adding, moving, or deleting content. For pricing consistency 
the Page and Price calculation for a pre-published section in the present embodiment occurs one and 
only one time for the entire life of that content inside the CBO. However, it will be understood that a 
system designer can alternatively decide to permit changes to prepublished sections if desired. 

The price of a CBO is dependent on the page count. Therefore, the page count is calculated 
10 first. According to the present embodiment, the Chapter Page Count calculation is the aggregation of 
the Section and UPM Page Counts for any content that exists in that chapter. The Chapter Price 
k§ calculation is the aggregation of only the Section Prices for any sections that exist in that chapter. 
m Calculation of the Pages for volumes is the aggregation of the Chapter and UPM Page Counts for all 
2f chapters in that volume and UPM that exist as a direct child of that Volume. The Volume Price 
Ql 5 calculation is the aggregation of only the Chapter Prices for any Chapters that exist in that volume. 
" ? 5 Calculation of the Page Counts for the entire CBO is the aggregation of all of the Volume Page Counts. 
jUi pri ce calculations for the CBO are done with the Volume Page Counts, Total UPM Page Count, 
III Volume Prices, and the specific program variables as inputs into the CBO price calculation formula, 
f h This embodiment implies the following page count calculation order: 

1 . Section/UPM Page Count 

2. Chapter/UPM Page Count 

3. Volume Page Count 
4 Book Page Count 

25 

A section's page counts are estimated based upon the number of non-tag characters in the 
content, plus the number and type of associated components (AC). ACs are graphics appearing in the 
section and cannot exist outside of the section. In an effort to make the page estimating process more 
accurate, ACs are broken up into five different categories based on size. The categories below are 
30 differentiated and their values stored in the Program_Aux index class and their values stored in 
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PE_AC_ByteCount. The E_ProductAux index class will then contain matching ACFORMIDs and 
NUMBERAC values to use in the calculation. Page counts will be rounded to the nearest tenth of a 
page. 

1Associated Component Categories 



1 


Inline Graphic 


2 


Display Graphic 


3 


Inline Equation 


4 


Display Equation 


5 


SGML 



Li some cases, the automated page/price calculator does not prove to be accurate to the original 
pre-published material's figures. Therefore, a Use_Actuals option is provided within the calculator. 
This option provides the system administrator with the capability to override a calculated value for the 
page and/or price values for a pre-published content. This Use_Actuals capability forms the beginning 
of the calculator process. 

Page and Price Count Calculator Process 

1 . When Use_Actuals is " 1 " and PageCount has a value, then use PageCount 

2. When Use_Actuals is "0" or PageCount has no value, then use the calculation below. 

a) First, calculate the number of "characters" in the content, using textual SGML, 

images, graphics, and subdocument SGML. These values are preferably calculated by 
Converter 10 and are loaded by the Eloader program. The calculator then multiplies the number 
of each category of associated component by the average byte count for each category and then 
adds the number of source characters (CharjCount ) in the SGML. Char_Count is stored 
within the Product Index Class. 



Characters = (E_Char_Count + ^nwci^w^ 



Equation 2. Characters per page calculation 
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b) Second, divide the number of "characters" by the average number of "characters" 

per page (AvgCharPerPage) to get the estimated page count. The AvgCharPerPage variable is 
stored within the Program index class and placed within the CBO at CBO create time. 

Pages = Characters / E_AvgChrPerPage 
Equation 3. Estimated Pre-published Pages Calculation 

3. Calculate the UPM Estimated Page Count by dividing the number of UPM characters (UPMBytes) 
by the number of UPM characters per page (PE_UPM_Bytes_Per_Page). The UPMBytes is the UPM 
file size and is computed when the UPM is added to the CBO and stored into the CBO. The 
PE_UPM_Bytes_Per_Page variable is stored within the E_ProgramAux index class and placed within 
the CBO at CBO create time. 

UPM Page Count = UPMBytes / PE_UPM_Bytes_Per_Page 

Equation 4. UPM Page Count Calculation 

Once the page count is computed, the pricing information can be derived. Pre-published 
content pricing is determined by first multiplying the estimated page count with the price per page 
value. 

Estimated Price = Pages x PE_Source_Price_Per_Page 
Equation 5. Estimated Base Price Calculation 

Note that UPM pricing is calculated at the CBO level. Therefore, the price for an UPM Section 
is never calculated. However, the UPM Section Page Counts are calculated every time the CBO Page 
and Prices are calculated. 
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Next, the UPM cost is calculated and added to make a total publication cost. The UPM 
calculation is based on a base charge for all books containing UPM content. This base charge is the 
minimum cost added to a book containing UPM. 

If a CBO contains UPM then the UPM estimated page count (UPM_Page_Count) is used to 
5 select the equation to be used. When the UPM Page Count is greater than the UPM Base Size 

(PE_Chars_PerJJPM_Tier) perform the calculation in Equation 5. Otherwise, use the calculation in 
Equation 6. PE_Chars_Per_UPM_Tier is stored within the Program Index Class and stored within the 
CBO at creation time. 

10 Use the greater of the minimum CBO cost (PE_Min_Order_Price) and the CBO 

Estimated Price. 

j* If UPMBytes > 0, then UPM_BaseL_Charge = PE_Base_UPM_Fee; otherwise 

ffl UPM_Base_Charge = 0; 

'"•fkz- 

X CBO Estimated Price = PE J3ase„Cust_Pub_Price + 

Ull 5 X 1 " n( (Pages) x PE_Source_Price_Per_Page + PE_Base JJPM_Fee + ((UPMBytes - 

0 PE_Chars_Per_UPM_Tier) / PEJJPM_Bytes_Per_Page) x PE_lncrJJPM- Fee 

!; y 

Equation 6. Estimated Price Calculation with UPM greater than base 

If UPMBytes > 0, then UPM_Base_Charge = PE_Base_UPM_Fee; otherwise 
UPM JBase_Charge = 0; 

20 CBO Estimated Price = PE„Base_Cust_Pub_Price + 1 1 - 1 (Pages) + PE_Base_UPM_Fee 

Equation 67. Estimated Price Calculation with UPM less than base 

Example 

25 The Page and Price count example consists of a Custom Book Outline (CBO) entitled Student 

Loans. Referring to Fig. 19, two pre-published sections 201, 202 are added to the CBO resulting in the 
HTML table of contents view shown. 



STL000023US1 



-95- 



In the above example, the ECBOGet function was called at the conclusion of adding the two 
pre-published sections (ECBOAddContent). The CBO calculations are shown below. 

Section 1.1 Characters = (40 + (2 x 50) + (3x10)) = 170 

Section 1.1 Pages =170/290 = 0.586 = 0.6 rounded (203) 

Section 1.1 Estimated Price = 0.586 x. 10 =.0586 = .06 rounded (204) 

Section 1.2 Characters =(70 +(3x50) + (7x 10)) =290 

Section 12 Pages =290/ 290= 1.0 (205) 

Section 1.2 Estimated Prices =1.0x.10 = .1 (206) 

After calculating the newly added section page and prices, the chapter's page and price 
calculations can be performed This consists of the summation of it's contents. For chapter one the 
calculation is: 

Chapter 1 Pages = 0.6+ 1.0 = 1.6 
Chapter 1 Estimated Price =.06+.1 = .16 

Next, the volume page and price is calculated by summing the chapter values and adding the 
cost associated with the front matter and back matter (i.e. index). In this case, volume 1 contains only 
one chapter, 1 . The page and price counts for the volume are shown below. It assumes the page and 
costs of the front and back matter are: 15 and 5 respectively. 

Volume 1 Pages =1.6+15 + 5 = 21.6 (207) 
Volume 1 Estimated Price = .16+ 1.5+.5=2.16 (208) 

Lastly, the book's values are calculated by summing all the volume values. In this case, there is 
only one volume. The book's values are therefore the same as the Volume 1 page count and estimated 
price. 
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5. Volume Boundaries Determination 

One of the inherent properties of a CBO is a maximum number of pages allowed to exist 
in a Volume. This volume page count limitation is retrieved from the ProgramAux index class 
5 and stored into the CBO when it is created. It is that value, stored in the CBO, that is used to 
define the volume boundaries for the life of that CBO. Several operations that the user can 
perform, including moving, relocating, adding, and deleting content, will create a need for the 
volume page counts for that CBO to be examined. Alterations to the layout of the volumes to 
accommodate Page Limits will occur automatically by this routine with the following conditions: 

10 

All changes to volumes are done with a move up or move down. 
Added/Updated UPM can never have a higher page count than the Volume 
Page Limit 

Added/Updated Chapters can never have a higher page count than the Volume 
Page Limit 

If a Chapter is moved from one Volume to another, the Chapter is moved in its 
entirety. No "Chapter Splitting" is permitted. 

After content is moved from one volume to another, the sequential order of 
content in the table of contents for the CBO should be exactly as before the 
move occurred. 

A new volume is created if a volume page limit is exceeded and there are no 
volumes below that volume in which to move the exceeding content. 
Content can be moved up if there is enough space created in a preceding 
volume to fit the first piece of content from the next volume. 
If a volume no longer contains any content, it is removed. 

ReorganizeCBO Routine 

30 The reorganizeCBO routine is an internal function not defined in the API layer. It is used 

by other public functions (i.e. ECBOAddUPM, etc.) to facilitate volume management. This 
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routine is responsible for managing the volume page limits of a CBO. This routine accomplishes 
this task in the following steps: 



1) Retrieve this CBO's volume page limit from inside the CBO 

2) For each volume in the CBO do the following: 

a) Calculate the number of pages in a volume X 

b) If the number of pages in volume X is greater than the limit 

i) Find the last piece of content in volume X 

ii) Make sure the size of the last piece of content is less than the 

Volume Page Limit 

iii) Get the next volume, X+l , in the system, or create a new one if 

there isn't one 

iv) Place the last piece of content at the head of volume, X+l 

v) recalculate the page counts for volume X and X+l 

vi) Re-call (a.k.a. recursion) the reorganizeCBO routine from the 

beginning to see if moving the last piece of content from 
volume X+l to volume X has balanced everything out. 

a) Check to see if volume X - 1 has enough space to move the first piece 

of content from volume X up into volume X - 1. If volume X - 1 does 
not exist, this step and substeps are skipped. 

i) Place the first piece of content from volume X at the end of 

volume X - 1 

ii) Recalculate the page counts for both volume X - 1 and X 

iii) Re-call (a.k.a. recursion) the reorganizeCBO routine from 

the beginning to see if moving the first piece of content up 
a volume has balanced everything out. 

a) Check to see if the volume X is empty. If so, it can be deleted. 

b) Recalculate the page and prices for the entire CBO and then go back to 
step a) using the next volume in the CBO. When all volumes make it 
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through step e) the CBO is balanced and the pages and prices for the 
CBO have been recalculated and the routine is finished. 

Referring to Fig. 20A, suppose we have a CBO entitled, Engineering, and the volume 
page limit is 45 pages. This is a rather small volume page limit, but it is suitable for this 
example. This CBO contains one chapter 221 entitled, Professionalism and Codes of Ethics. 
The page count 222 for this CBO is 36.6 pages. It remains under the 45 pages per volume limit 
therefore only one volume 223 exists. 

Referring now to Fig. 20B, assume that we want to add a second chapter 224, 
Understanding Ethical Problems, consisting of 1 1.9 pages. Since 36.6 (chapter 1 page count 
including volume front and back matter) + 1 1.9 (chapter two page count) is greater than 45, a 
new volume 225 must be created to hold this second chapter. This new volume 225 contains 
front matter and back matter 154 of its own. This additional front and back matter 154 results in 
a larger final page count 226 of 68.5 for both volumes. 

Suppose we would like to move Chapter 2, section 2.1 227 entitled "Introduction" into 
Chapter 1. This can be done because section 2.1 contains only 0.3 pages and there are over 4 
pages of extra room left in volume L The results of this move are shown in Fig. 20C. 

6. Prerequisite Checking 

One of the tasks performed during the process of adding and deleting content involves 
validation of prerequisite pre-published content. Prerequisite content is some pre-published material, B, 
that is required within a CBO when a piece of material, A, is included in the CBO. The location of B 
and A within the CBO is the xesponsibility of the add and delete routines. The focus of this discussion 
is to determine the required content for a piece of material, A. 

The Business Problem 

Often times within the publishing industry, a piece of content requires other content to be 
included. For example, a section discussing Calculus integration requires a section outlining the 
integration homework problems. The Calculus integration section is known as the requiring section 
and the homework problems is the section being required. It is important to note prerequisite content is 
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only definable at the section level and therefore precludes chapters and volumes from being prerequisite 
material. These required sections are most easily understood in the form of business rules. 

The table below shows six prerequisite business rules. The "Requiring Resource" column 
defines the resource requiring prerequisite material. The "Required Resource List" column defines the 
5 prerequisite material required. The term "Case" is equivalent to a Section, i.e., both are leaf nodes in 
the hierarchical model. The "Requirement Type" column is the type of the requirement. It can be 
either "one" or "all". If it is "one", then only one of the required resources needs to be placed into the 
QBO. If the CBO, contains none of the required resources then the first required resource in the list is 
included. In the event that the type requirement is "all", all required resources must be placed into the 
10 CBO. 

The "Program Restriction" column defines the program restriction type. This column coupled 
with the "Programs to Include or Exclude" column defines the program context of the prerequisite 
Jfj material. The value of this column can be either "include" or "exclude". If it is "include", then the 
;Jf requiring rule only applies to those programs listed in the "Programs to Include or Exclude" column. If 
Oil 5 it is "exclude", then the requiring rule only applies to those programs not listed in the "Programs to 
" Include or Exclude" column. "All" specifies all programs rather than having to explicitly define each 
! 5 f one. 

I y The 'Temporal Restriction", "Effective Start Date" and "Effective Termination Date" columns 

J-i define a time context for the requiring rule. A time context states for the rule to either apply only during 
^ J 20 a specified time period or not to apply at all during the specific time period. The "Effective Start Date" 
and "Effective Termination Date" columns define the start and terminate dates for a rule. 
The 'Temporal Restriction" column can contain two values: "includes" and "excludes". If the value is 
"includes", the rule is to be applied only during the time period outlined by the "Effective Start Date" 
and "Effective Termination Date" columns. If the value is "excludes", the rule is not to be applied 
25 during the time period outlined in these columns. 
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Requiring 
Resource 


Requirement 
Type (all or one) 


Required 
Resource List 


Program 
Restriction 
Includes or 
Excludes 


Programs to 
Include or 
Exclude 


Temporal 
Restriction 
Includes or 
Excludes 


Effective Start 
Date 


Effective 

Termination 

Date 


Section 3 


ALL 


Section 2 


N/A 


N/A 


N/A 


N/A 


N/A 


CaseD* 


ONE 


Case B, Case C 


N/A 


All 


N/A 


N/A 


N/A 


Section 4 


ALL 


Section 3 


N/A 


All 


Includes 


May 1998 


May 1999 


Section 3 


ALL 


Section 2 


N/A 


All 


Excludes 


May 1999 


September 
1999 


Case D* 


ONE 


Case B, Case C 


Excludes 


Business and 
Economics 


Excludes 


May 1999 


September 
1999 


CaseB* 


ALL 


Case A 


Includes 


MIS 


N/A 


N/A 


N/A 



In the business rule represented by row 1 , the requiring resource is Section 3 of a prepublished 
book and requires section two. Since the "Program Restriction" and "Programs to Include or Exclude" 
columns are marked "N/A", this rule is to be applied to all programs. In addition, it is to be applied all 
the time since no start and end dates are specified. The business rule in row 4 also pertains to Section 3. 
The rules of rows 1 and four are similar with the exception of the applicable time period. In this 
situation, section 2 is included prior to May 1999 and after September 1999. Thus, rules can be 
combined to form more complicated rules relationships. 



A Solution 

To apply the above rules to the system of the present invention, a few assumptions have been 
made. First, it is assumed the data model previously defined for pre-published content cannot be altered 
significantly to incorporate these changes. Second, when a piece of content is added or deleted, it is 
assumed the corresponding function handles the location of the material within the CBO. Third, it is 
assumed this solution must only define the applicable business rules for defining prerequisite material. 

A challenge posed by the business rules shown above is in the area of program and time period 
exclusions. This is due to the fact exclusion and inclusion rules can often conflict with one another. In 
the present embodiment, the defining of rules has been limited to "positive" rules to reduce the 
possibility of exclusion time periods. In other words, in the example described previously regarding 
section 3 requiring section 2, the 1 st and 4 th rules can be replaced with the following two rules. 
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Requiring 
Resource 


Requirement 
Type (all or one) 


Required 
Resource 
List 


Program 

Restriction 

Includes 


Programs to 
Include 


Temporal 

Restriction 

Includes 


Effective Start 
Date 


Effective 
Termination Date 


Section 3 


ALL 


Section 2 


N/A 


N/A 


Includes 


Sept 1977 


May 1999 


Section 3 


ALL 


Section 2 


N/A 


N/A 


Includes 


May 1999 


September 1999 



10 



In an effort to reduce the possibility for conflicting rules, the negative or exclusive rules have 
been replaced with positive rules. Li addition, the required resource list is only permitted to have one 
section per rule (row). This implies that the requirement type is always one and can therefore be 
eliminated from the grid. Similar to the resource column, the "Programs to Include" column is only 
permitted to have one program per rule (row). Lastly, it is assumed the program is explicitly stated. 
The grid below shows a simplified grid as a result of making these assumptions. 



Requiring 
Resource 


Required Resource 


Program to Include 


Effective Start 
Date 


Effective 
Termination Date 


Section 3 


Section 2 


Business Economics 


Sept 1977 


May 1999 


Section 3 


Section 2 


Business Economics 


Sept 1999 


Sept 2099 



15 



The following grid shows another example involving the two rules (rows 2 and 5) for Case D 
defined in the first table. The 5 th rule has been converted into four rules (rows 1-4, below) and the 2 nd 
rule has been converted into two rules (rows 5 and 6, below). 



Requiring 
Resource 


Required Resource 


Program to Include 


Effective Start 
Date 


Effective 
Termination Date 


CaseD 


CaseB 


Business Economics 


N/A 


N/A 


CaseD 


CaseC 


Business Economics 


N/A 


N/A 


CaseD 


CaseB 


Business Economics 


Sept 1997 


May 1999 


CaseD 


CaseB 


Business Economics 


Sept 1999 


Sept 2099 


CaseD 


CaseC 


Business Economics 


Sept 1997 


May 1999 


CaseD 


CaseC 


Business Economics 


Sept 1999 


Sept 2099 



A rule collision exists in the preceding table for the program business economics between the 
20 dates of May 1999 and September 1999. During this time, rules 3 - 6 do not apply. However, rules 1 
and 2 do apply since they do not have a defined start and termination time period The result is that 
Cases B and C are prerequisites at all times (even between May 1999 and September 1999). Prior to 
May 1999, rules 3 and 4 are exercised. Between May 1999 and September 1999, rules 1 and 2 are 
exercised and after September 1999 rules 5 and 6 are used. The effect desired is for no rules to be 
25 exercised between May 1999 and September 1999. This can be accomplished by removing rules 1 and 
2. At the present time, this problem has a low likelihood of occurring and is thus left to the system 
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administrator to manage. The system administrator is informed this is an erroneous situation and is 
requested to remove rules 1 and 2. 

The prerequisite content is defined at the time a pre-published book is loaded. An editor 
identifies the prerequisite content for a pre-published book and defines it in the format defined in the 
grid below. This format is placed into a PSF/ATR file and loaded into DL using the same facility that 
loaded the pre-published books, batch load facility. 



BNF Grammar Construct 


Value 


<prerec] content> 




<relation> 


RELATION: <sequence id> 
[ <prereq attr group> ]+ 


<prereq attr group> 


IPREREQ 

!KIND: <kind> 
1PROGRAM: <program id> 
1REQUIREDID: <sequence id> 
ISTARTDATE: <date> 
IENDDATE: <date> 
[DESCRIPTION: <description> 


<kind> 


Requires 


<program id> 


A 2 letter string 


<sequence id> 


XXXXXXXXXX.CC.SS.OO where X is the 
ISBN number; C is the chapter number; S is 
the section number. 


<date> 


mm/dd/yy where mm is the month; dd is the 
day and yy is the year. 


<description> 


A text description to the end of line 



Suppose Section 3's ISBN number is 0234567891.01.03.00 and Section 2's ISBN number is 
0343434343.02.02.00. The following grid defines two sample rules. The following PSF/ATR file 
format represents these two rules. 



Requiring 
Resource 


Required Resource 


Program to Include 


Effective Start 
Date 


Effective 
Termination Date 


Section 3 


Section 2 


Business Economics 


Sept 1977 


May 1999 


Section 3 


Section 2 


Business Economics 


Sept 1999 


Sept 2099 



RELATION: 0234567891.01.03.00 
IPREREQ 

!KIND: Requires 

IPROGRAM: FE 

IREQUIREDID: 0343434343.02.02.00 
ISTARTDATE: 09/01/1977 
IENDDATE: 05/01/1999 
IDESCRIPTION: Rule One 
IPREREQ 
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!KIND: Requires 
{PROGRAM: FE 

IREQUIREDID: 0343434343.02.02.00 
ISTARTDATE: 09/01/1999 
IENDDATE: 09/01/2099 
[DESCRIPTION: Rule Two 



A group of configuration files are used in defining the DL data model to PSF/ATR file format 
mappings. In the case of prerequisite information, the DL data model consists of the following two 
index classes. The index class attribute to PSF/ATR file mappings are also provided. 



Prereq Index Class 



SeqID 


SeqJD 


ExLAIpha [32] INDEXED 


EntityType 


PSF 


Ext.Alpha[321 


Parentltem 


Program generated 


ExLAIpha [161 


Siblingltem 


Program generated 


ExLAIpha [161 


Childltem 


Program generated 


ExLAIpha [161 


Auxltem 


Program generated 


ExLAIpha [16] 


Subcompltem 


Program qenerated 


ExLAIpha [16] 


Program ID 


PEJD 
AC PE ID 


Alpha [4] 



PrereqAux Index Class 



SeqID 


PSF 


Ext. Alpha [321 


Prereqitem 


Program qenerated 


Ext. Alpha [161 


Parentltem 


Program generated 


Ext. Alpha [161 


Siblingltem 


Program generated 


Ext. Alpha [16] 


Childltem 


Program generated 


Ext. Alpha [16] 


Keyword 


Relation 

Kind 

RequiredlD 
StartDate 
EndDate 
Description 


Alpha [32] 


Value 


ATRtile 


Ext. Alpha [2541 


NextValueltem 


Program generated 


Ext. Alpha [16] 



Once the data is loaded into these index classes, it can be queried by the add and delete 
functions. Each time the add and delete functions need to determine the applicable rules, a query is 
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made to the PreReq index class. A sample query which searches for those rules applicable to a section, 
0562423452.03.02.00, is shown below. 

EntityType == RELATION 
5 Kind == 'Requires' AND 

Program == 'FE' AND 
Seq_ID== '0562423452.03.02.00' AND 
StartDate <= 'today's date' AND 
EndDate >= 'today's date' 

10 

The results from the query are the list of required sequence identifiers for the specified 
requiring sequence identifier. With the list of requiring ids identified, the add, and delete 
operations can manipulate the sections appropriately. 

Ol5 7. Rights Management (Mutual Exclusiveness) 

s £ One of the tasks performed during the process of adding content involves validation of rights 

% managment. Rights management consists of performing mutual exclusive checks on content prior to 

5 adding it to a CBO. Mutual exclusive content is some pre-pubhshed material, B, that is not permitted in 

U] the same publication as some piece of material, A. The location of B and A within the CBO is the 

1^20 responsibility of the add routine. The focus of this discussion is to determine the mutually exclusive 

^; content for a piece of material, A. 

The Business Problem 

At times within the publishing industry, a piece of content is not permitted to appear in the same 
25 publication as another piece of content. For example, two authors, Joe and Sarah, have a dislike for one 
another and have stated they do not allow their works to be published together. To prevent this 
situation, a check is performed when adding pre-published content authored by Joe to a CBO to make 
sure the CBO does not already contain pre-published content by Sarah. 

The described problem requires business rules for pre-pubhshed content to be defined in the 
30 datastore. These business rules are in the form of: content A cannot exist with content B . Due to the 
term "cannot" in these business rules, the rules are considered negative business rules. This is in 
contrast to prerequisite checking, where positive business rules of pre-published content inclusion 
applied. In the event one of the negative business rules are true, the pre-published content is not added 
to the CBO. 
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Each negative business rule consists of two pieces of information: the pre-published content, A, 
being added and the pre-published content, B, not permitted to exist with A. One or more negative 
business rules can be defined for a pre-published content A having different pre-published content B 
specified. 

The following grid illustrates two negative business rules. In the first rule, pre-published 
content, A, is not permitted to be added to a CBO containing pre-published content, B. In the second 
rule, pre-published content, A, is not permitted to be added to a CBO containing pre-published content, 
C 



Prepublished Content to be Added 


Prepublished Content Not Permitted to 
Exist Within CBO 


A 


B 


A 


C 



A Solution 

To apply the above negative rules to the system of the present invention, a few assumptions 
have been made. First, it is assumed the data model previously defined for pre-published content 
cannot be altered significantly to incorporate these changes. Second, when a piece of content is added, 
it is assumed the corresponding function handles the location of the material within the CBO. Third, it 
is assumed that this solution must only define the applicable business rules for defining mutually 
exclusive material. 

Unlike the pre-requisite material, mutually exclusive material does not have a time period 
associated with it. A mutually exclusive rule is assumed to be in effect at all times. Since no time 
period is defined for the rules, contradictory rules cannot exist. Thus the data model representing these 
rules is simpler than that for pre-requisite material rules. 

The mutually exclusive content is defined at the time a pre-published book is loaded. An editor 
identifies the mutually exclusive content for a pre-published book and defines it in the format defined in 
the grid below. This format is placed into a PSF/ATR file and loaded into DL using the same facility 
that loaded the pre-published books, i.e., the loader 14 batch load facility. 
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BNF Grammar Construct 


Value 


<mutually exclusive content> 


<relation> 


<relation> 


RELATION: <sequence id> 
[ <prereq attr group> ]+ 


<prereq attr group> 


EXCLUSION 

IPROGRAM: <program id> 
IriAL^-LUL'EiiJLJ. < sequence iu^> 
!DESCRIPTION: <description> 


<program id> 


A 2 letter string 


<sequence id> 


XXXXXXXXXX.CCSS.OO where X is the 
ISBN number; C is the chapter number; S is 
the section number. 


<description> 


A text description to the end of line 



Suppose a Section 3 has an ISBN number is 0234567891.01.03.00 and a Section 2 has an ISBN 
m 5 number is 0343434343.02.02.00. Both sections have been selected for addition to a CBO. The 

'5 following grid defines two sample rules. The first rule states to not permit pre-published content, 
m Section 3, to be added to a CBO containing pre-published content, Section 2, for a program, Business 
M Economics. The second rule states to not permit pre-published content, Section 2A, to be added to a 
Hi CBO containing Section 3C, for the program, Freshmen Engineering. 



Prepublished Content to Add 


Mutually Exclusive 
Pre-Published Content 


Program to Include 


Section 3 


Section 2 


Business Economics 


Section 2 


Section 3 


Freshmen Engineering 



The following PSF/ATR file format represents these two rules. 

15 RELATION: 0234567891.01.03.00 

[EXCLUSION 

IPROGRAM: BE 

IEXCLUDEID: 0343434343.02.02.00 
IDESCRIPTION: Rule One 
20 RELATION: 0343434343.02.02.00 

EXCLUSION 

IPROGRAM: FE 

IEXCLUDEID: 0234567891.01.03.00 
IDESCRIPTION: Rule Two 

25 

Assuming only the preceding two rules exist in a system, adding Section 2 to a CBO containing 
Section 3 within the Business Economics program would be permitted, contrary to the intended result. 
The following reciprocal rule can be generated to eliminate this situation. 

30 
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Prepublished Content to Add 


Mutually Exclusive 
Pre-Published Content 


Program to Include 


Section 2 


Section 3 


Business Economics 



10 



Therefore, each rule excludes a section of material, A, from being added to a CBO containing 
another section of material, B. If it is desired to have material A exclude material B and material B 
exclude material A then two business rules are necessary. One rule for each exclusion. 

As previously described, group of configuration files are used in defining the digital library 20 
data model to PSF/ATR file format mappings. In the case of rights management information, the data 
model consists of the following two index classes. The index class attribute to PSF/ATR file mappings 
are also provided. 

Rights Index Class 





SeqID 


SeqJD 


Ext. .Alpha [321 INDEXED 




Auxltem 


Proqram qenerated 


Ext..Alpha[16l 




ProgramID 


PEJD 


Alpha [4] 






AC PE ID 





RSghtsAux Index Class 



15 



20 



SeqID 


PSF 


Ext. Alpha [321 


Rightsltem 


Program generated 


Ext. Alpha [16] 


Siblingltem 


Program generated 


Ext. Alpha [161 


Childltem 


Proqram generated 


Ext. Alpha [161 


Keyword 


Relation 

Description 


Alpha [32] 


Value 


ATRfile 


Ext. Alpha [2541 



For each relation defined in the PSF files, a row is added to the Rights index class. For each 
exclusion within the relation, a row is added to the Rights_Aux index class. All the exclusions for a 
relation are ordered prior to loading into these index classes. The unique identifier of the first 
exclusion's row is placed into the Auxltem column of the relation's Auxltem column within the Rights 
index class. The Siblingltem column of each exclusion contains the unique identifier of the next 
exclusion within a relation. The last exclusion for a relation contains nothing in the Siblingltem field. 
Once the data is loaded into these index classes, it can be queried by the add function. Each time the 
add function needs to determine the applicable rules, a query is made to the Rights index class. A 
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sample query which searches for those rules applicable to a section, 0562423452.03.02.00, is shown 
below. 

EntityType = RELATION 
ProgramID == 'FE' AND 
Seq_ ID = '0562423452.03.02.00' 
Return the Auxltem column 

The results from the query are the values within the Auxtltem column that are required 
sequence identifiers for the specified excluded sequence identifier. The Rights Aux index class 
can be further queried to determine those sequenced ids of the excluded material. With the list of 
exclusion's ids identified, add operations can manipulate the sections appropriately. 

8. Searching for Content 

As mentioned previously, there are two methods for selecting content to include in a 
custom book outline. The first is to select content from a bookshelf. The second is to search the 
content by querying the digital library 20 based on user-provided search terms. Referring back to 
Figs. 3 and 6, the search capability is defined by the search support 52 in the underlying digital 
library 20. 

In the present example, the IBM Digital Library includes a parametric search capability 
for searching attributes defined in the data model, and a text searching capability (through the 
IBM Intelligent Miner for Text) for searching the content itself. It will be understood that no 
search capability is required for the present system to be functional. Other types of search 
capability may be provided in addition to, or in place of the search capability described. For 
example, the IBM Digital Library is also able to provide legacy searching capability, search 
gateways, and Query by Image Content. 

The search capabilities present are made available to the user through anEQuery 
application in API Layer 30 communicating with the digital library 20. TheEQuery API and 
application layers provide the ability to perform hierarchical searches against digital library, 
which is a non-hierarchical data store. This is achieved by the way the data is stored in the entity 
groups: primary/auxiliary. All hierarchical levels of a pre-pub book are stored within the same 
digital library index class. Therefore, one or more simple digital library/Intelligent Miner for 
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Text queries on these primary/auxiliary index classes are all that are needed to perform 
hierarchical searches. 

An interesting challenge still remains - merging the results of individual queries 
performed on different hierarchical levels. This problem is addressed using "hit masks", and will 
5 be discussed in more detail subsequently. 

EQuery as implemented in the present embodiment includes two formal interfaces: One is 
internal and available directly as a native language function call, i.e., a Search function residing 
in application layer 28. The other is external and available via the command line and is used 
primarily for testing and diagnostics. The native language call uses memory buffers for the input 
10 search criteria and output search results. The command line version uses files for the input 

search criteria and output search results. Both interfaces use the same search criteria format and 
\Q search results format according to the following rules: 

IB 

Search Criteria Specification 

015 

General Rules 

!h? 1 . Every line is terminated by a newline charater. 

Ill 2. Each line that begins with a semicolon in the first column will be ignored. It may be used to start 
j;3 some comments. 

M20 3. Search Criteria structure consists of one or more Individual Queries. 

4. The results of all the Individual Queries will be merged by entities to form one single Search Results 
structure. 

Individual Query Rules 

25 1 . Each line that begins in the first column (assuming not a semicolon) is the beginning of a new 
Individual Query. 

2. The scope ENTITY indicates that the query scope will be the particular entity type specified by the 
Search Level. Only those entity types listed in the group attribute files (* atr files) are valid. 
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2. The scope GROUP indicates that the query scope will be all the entity types within one particular 
data model group specified by the Search Level. Only those data model groups listed in the groups file 
are valid. 

3. The Return Attribute List is a list of attributes to be returned for each hit. The list is optional. If the 
5 list is left out, only Entity Type and Sequence-ID for each hit will be returned. Only those attributes of 

type ENTITY or type SYS are supported in the Return Attribute List. If an attribute specified is not 
found or its attribute value is NULL for a particular hit, the attribute is simply ignored with no errors. 

4. The body of an Individual Query may contain one Text Search Expression, or one Parametric 
Expression, or one Text Search Expression followed by one Parametric Expression. 

10 

j!t% Text Search Expression Rules 

*J3 1 . A line that begins with exactly one tab followed by TEXT= is the beginning of a Text Search 

:f"' : 

135 Section. 

)S 2. The Text Attribute following TEXT= indicates indirectly which text index in TextMiner to query on. 
Ol 5 Only those attribute defined to be associated with a text index are valid. 

r 3. The body of a Text Search Section may contain one Free Text term, or one Text Search Expression, 

f: or one Free Text term followed by one Text Search Expression. 

HI 4. The valid Boolean Operators are AND (set intersection) and OR (set union). 

b 5. The valid Text Search Types are EQ (contained in document), NE (not contained in document), II (is 

^20 in document), NI (is not in document). 

6. Since the text index type chosen, Ngram, does not support Free Text, the Free Text term is provided 
only for backward compatibility purpose. The Free Text search string is always converted internally to 
multiple search words grouped by OR. 

7. A line may not have more than one additional tab from the previous line, if the previous line contains 
25 a Boolean operator. A line may not have more tabs than the previous line in all other cases. 

8. A line with fewer tabs than the line previous to it is a sibling of the most recent line with the same 
number of tabs as the current line. 
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Parametric Expression Rules 

L A line that begins with exactly one tab that is not followed by TEXT= is the beginning of a 
Parametric Section. 

2. The valid Boolean Operators are AND (set intersection) and OR (set union). 
5 3. The valid Parametric Types are EQ (equal), NE (not equal), H (is in), NI (is not in). 

4. The Parametric Value NULL is only valid when the Parametric Type is either EQ or NE. 

5. The Parametric Attribute indicates which column in the underlying database to query on. Only those 
attributes defined as mapped to a DL attribute are valid. 

6. A line may not have more than one additional tab from the previous line, if the previous line contains 
1 0 a Boolean operator. A line may not have more tabs than the previous line in all other cases. 

7. A line with fewer tabs than the line previous to it is a sibling of the most recent line with the same 
I number of tabs as the current line. 



01 
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EQuery BNF Grammar 



Search_Criteria := <Indiviual Query> 1 {<IndiviuaTQuery>]* 

Individual Query := <Query_Header> <Query JBody> 

Query Header := <Scope> =<Search_Level> :<Retum_Attribute_List> \n 

Scope := ENTITY II GROUP 

Search_Level := <Entity_Search_Level> II <Group_Search_Level> 

Entity_SearchJLevel := Name of any entity type defined in the 
group attribute file 

Group_Search_Level := Name of any data model group defined 

in the groups file _ 

Return_Attribute_List := <Attribute>[,<Attribute>l* 

Attribute := <Parametric_Attribute>ll <System_Attribute> 

Parametric_Attribute := Name of a DIGITAL LIBRARY attribute 
defined in a group attribute file 

System_Attribute := Name of an attribute defined in a group 

attribute file generated by system 

Query Body := <Text_Search_Section> I <Parametric_Section>] II 

<Parametric_Section> 

Text_Search_Section := \tTEXT=<Text_Attribute> 
\n<Text_Search_Option> 

Text Attribute := Name of an attribute defined in a group attribute 
file with a text index 

Text_Search_Option ;= 

<Text_Search_FreeText>[<Text_Search_Expression>] II 

<Text_Search_Expression> __ 

Text_Search_FreeText := \t\t<String>JT\n 

Text_Search_Expression := <Text_Search_Term> II 

<Text_SearchJ3oolean_Expression> 

Text_Search„Term := <TS_Tab_Level> 

<Text_Search_Argument>,<Text Search_Type>\n 

TS„Tab_Level := The level/sublevel of tab 

nesting 

Text_Search_Argument := "<String> M 

String :- Any search string 

Text_Search_Type := EQ II NE II n IINI 

Text_Search_Boolean_Expression := 
<TS_Tab„Level> 

<Boolean_Operator>\n<Text_Search_Expression> 

[<Text_Search_Expression>] * 

Boolean_Operator := AND II OR 

Text_Search_End_Term := 

<Text_Search__Term><Text_Search_Term> 

Parametric Section :=\t<Parametric_Expression> 
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ParametricJLxpression := <Parametric_Term> II 

<Parametric_Boolean_Expression> 

ParametricJTerm := 

<PM_Tab_Level><Parametric_Value>,<Parametric_Type> 

<Parametric_Keyword>\n 

PM_Tab_Level := appropriate number of tabs for 

this level 

ParametricJValue := "<String>" II NULL 

ParametricJType := EQ II NE II II IINI 

Parametric_Keyword := "<Parametric_Attribute>" 

Text_Search_Boolean_Expression := <PM_Tab_Level> 
<Boolean_Operator>\n <Parametric _Expression> 
[<Parametric_End_Term>] * 



Interface Specification 

Native Language Call Arguments 
Linkage: 

int EQuery( char* pCiiteriaBufferCriteria, 

char** ppResultsBuffer); 

pCriteriaBuffer points to a null-terminated buffer containing the search criteria. PResultsBuffer 
points to another pointer, which will point to a null-terminated buffer containing the search results. 

The buffer needed to hold the results will be dynamically allocated, and *ppResultsBuffer will point 
to the new buffer. If there is no results, the results buffer is allocated, but is empty. If an error is 
encountered, the results buffer is not allocated, leaving *ppResultsBuffer set to NULL. 

Command Line Arguments 
Syntax: 

EQuery <query file name> <results file name> 

<query file name> names a file containing the search criteria as described in 
AppendixA 

<results file name> names a file where the search results as described in Appendix 
B should be placed. 



Example: 

EQuery 01 2345678.qry 01 2345678.res 

This will launch EQuery using the file called "012345678.qry" to get the query search criteria. The 
hit results will be stored in the file "012345678.res'\ 
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To initiate a search through the native language call, the user inputs search terms via a 
static HTML search screen displayed in Fig. 22 The search screen is invoked by clicking on the 
"Search Contents" button 240 provided on the vertical menu bar of the Bookshelf page (see Fig. 
1 1). An exemplary search screen is shown in Fig. 21 A. The search screen includes input fields 
appropriate for the search capabilities supported. The "Chapter Titles" field 232 and a "Section 
Titles" field 233 of the present example are provided in support of parametric searching of the 
Title attribute defined in the Product Index Class. These fields are a matter of design choice and 
may be changed to specify any or all attributes defined in the data model. The "Index Words" 
field 231 is provided for entering text search terms in support of free text searching. By design 
choice, the search criteria entered into these fields will be ANDed together, as indicated by the 
"AND's" between the fields. However, other Boolean operators may be supported Full Boolean 
logic is preferably supported within all fields. 

Assume that the user is interested in finding chapters with the string "Engineering" in the 
titles, and the chapters contain sections with either "engineering" or "study" or "skills" in the 
titles, and the word "study" or "skills" is one of the index terms for the sections. To that end, the 
user has entered the search terms "study OR skills" into the text search field 231, "Engineering" 
into chapter title field 232, and "engineering OR study OR skills" into section title field 233, as 
shown in Fig. 21 A. By clicking on any "GO" button 234, the user invokes the Search procedure 

The search procedure receives the search terms and formats it for delivery to the EQuery 
search application. Input criteria from each field are treated as separate queries. Below is an 
example of the formatted search criteria, passed as the input parameter to Equery. 
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ENT I TY= CHAPTER . C : Hits , Rank, Title , HitWords 
AND 

"engineering" , II, "Title" 
"FE" ,EQ, "PE_ID" 
"1" ,EQ, "Status" 
ENTITY=SECTION: Hits , Rank, Title , HitWords 
AND 



OR 

"engineering" , II, "Title" 
"study" ,11, "Title" 
"skills" ,11, "Title" 
"FE" ,EQ, "PE_ID" 
"1" ,EQ, "Status" 
ENTITY= SECTION : Hits , Title , Rank, HitWords 
TEXT=Index_Term 
OR 

"study" ,11 
"skills" ,11 



AND 



"FE" ,EQ, "PE_ID" 
" 1 " , EQ , "Status" 



Additional parametric constraints may be appended at this time to improve the efficiency of the 
search. For example, the search criteria above have been ANDed with the user's program ID 
(PEJD) and a status value equal to "1" (indicating that the prepublished text is available for 
use). In addition, the chapter title search criteria are ANDed with the constraint of entity type 
equal to chapter, and section title search criteria are ANDed with the constraint of entity type 
equal to section. 

EQuery receives the search criteria and reformats them as necessary for input to the data 
repository. The query input format depends on the search support available. In the present 
example, the query is reformatted for input to the IBM DB2 Digital Library withTextMiner 
support. This involves parsing the search criteria and mapping the attribute names known by the 
web site into the attribute names in digital library as defined by the ELOADER.INI configuration 
file, EQuery sends the following search conditions (as part of the query strings) to the digital 
library 00 API 42. 
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Query 1, Text Search condition: 
NONE 

Query 1, Parametric condition: 

COND= ( ( E„Ent i tyType = = " CHAPTER . C " ) AND 
5 ( (E_Title LIKE " %engineering% 11 ) AND 

(E„ProgramID == "FE" ) AND (E„Status == "1"))) 

Query 2, Text Search condition: 
NONE 

10 Query 2, Parametric condition: 

COND= ( (E_Enti tyType == "SECTION") AND 

( ( (E_Title LIKE "%engineering%" ) OR (E_Title LIKE %study%" ) OR 

(E_Title LIKE " %skills% " ) ) AND 

<E_ProgramID == "FE") AND (E_Status == "1"))) 

15 

Query 3, Text Search condition (conceptually): 

C0ND=( 'study' OR 'skills') 
Query 3, Parametric condition: 

COND={ (E__Enti tyType == " SECTION" ) AND 
;;:J J20 ( (E_ProgramID == "FE") AND (E_Status == "1"))) 

CO The OO API 42 receives this input, then sends the text search to Text Miner through the 

ffl TextMiner API, and the parametric search to the digital library. TheTextMiner API and 
25 indexing requirements are described in the Digital Library Text Search Using TextMiner 
Application Programming Reference, First Edition (September 1997) available from IBM 
h& Corporation. 

: u A set of entities is returned by digital library 20 that matches the search criteria for each 

O individual query. If more than one query is specified, as in the example, then all queries are performed 
30 and the individual result sets are combined via a set union operation and returned as a single result set. 
During the union process, a hit mask is built for the Entity (described in more detail subsequently) to 
indicate which one or more of individual queries returned the Entity. This hit mask is returned with the 
merged result set if the "Hit" term has been specified in the return attribute list The Entities will be 
returned in sequence-ID order. 
35 Each query may list a subset of all possible attributes to be output for each Entity found. Only 

attributes with non-NULL values will be placed in the output. The format of the attributes will be the 
same as the Attribute File as defined in "ATTRIBUTE Definitions" above. Additional system 
generated attributes may be added to the list of attributes for each item found. These may include a 
query-hit mask, text search relevance ranking, and hit words for titles and index terms, for example. 

40 
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Search Results Specification 

The format of the Search Results is a merging of a Product Structure File (PSF) and an 
Attribute File. Each hit from an Individual Query will produce at least one line in the file. The line 
5 includes the Entity Type and the SequenceJD as defined in the original PSF. In addition, if any 

attribute name was specified in the Return Attribute List and the attribute value is not NULL for this hit, 
the attribute name-value pair will be included in a line with a leading exclamation mark. 

Hits, Rank, and HitWords are special system attributes. To place one such attribute in the 
results, that attribute name is included in the Return Attribute list. The Hits attribute is a "hit mask" 
10 comprising a string of bits indicating how may of the individual queries had hits. It is available for each 
Entity in the merged result set. Hit masks are described in more detail subsequently. Every character in 
# the string corresponds to one Individual Query that has been executed. The first character in the string 
i|i corresponds to the first query, the second character in the string corresponds to the second query, and so 
:p on. If the Entity is present in the result set of a given query, that character in the attribute value string 
0 1 5 will be a one. If the Entity is not present in the result set, the character will be a zero. One, more, or all 

s i::: 

y = 

„ (but never none) of the characters in the string will be one. 

The Rank attribute is a number and it is available for only part of the merged result set Rank is 
Hi a number between 1 and 1 00 given by Intelligent Miner for Text to represent the relevance ranking of 
h each hit relative to all hits from one text search query. The attribute is available for each Entity hit from 
u4 20 an Individual Query containing a Text Search Criteria Term. 

The HitWords attribute is available only for part of the merged result set. More specifically, it 
is only available for an Entity hit of type SECTION. When included as part of the Return Attribute list, 
up to two system-generated attributes, section_sectiontitle and section_idx, may be included in the 
results. The section_sectiontitle attribute keeps a list of word hits in a section tide, with a comma as the 
25 word separator. The section_idx attribute keeps a list of word hits in the index terms or key terms for a 
section, with comma as the word separator. The existence of these attributes is for applications to 
determine which words to highlight when the hit document is displayed. To implement these, user 
inputs for querying section title or index terms are parsed AS IS into individual words and stored in the 
attributes. 



STL000023US1 



-118- 



Below is an example of a merged results file returned as the output parameter from EQuery for 
the preceding queries: 



SECTION: 013 011037X. 10. 03. 00 
5 !Hits:010 

! Title : Social Engineering 

! search-sectiontitle: engineering, study, skills 

CHAPTER. C: 013 013 149 0.02. 00. 00 

!Hits:100 

10 I Title: An Introduction to Engineering Problem Solving 
! search-chapterti tie : engineering 
SECTION: 013 013149 0.02 .04.00 
!Hits:010 

[ Title : An Engineering Problem-Solving Methodology 
15 ! search-sectiontitle : engineering, study, skills 
CHAPTER. C: 013 080 8598. 02 .00.00 
!Hits:100 

! Title: Studying Engineering: The Keys to Success 

! search-chaptertitle : engineering 
Q20 SECTION: 0130808598. 02. 02. 00 
C* 1 Hits: 001 

JC 1 Title: Commitment 

ffs [Rank: 24 

"'J ! search-idx: study, skills 

^25 SECTION: 0130808598. 02. 03. 00 

y [Hits: 0 01 

O ! Title : Application 

yl 'Rank: 24 

! search-idx: study, skills 
11*30 SECTION: 0130808598. 02. 04. 00 

I Hits: 001 

! Title : Strategy 
HI ! Rank: 24 

Li: ! search-idx: study, skills 

j**35 SECTION: 0130808598 . 02 . 05 . 00 
!2f [Hits: 001 

[Title : Perseverance 

[Rank: 24 

[ search-idx : study , skills 
40 SECTION: 013 0808598. 02. 06. 0 0 
[Hits:001 

! Title: Associations 
[Rank: 24 

[ search-idx: study, skills 
45 CHAPTER. C: 013 080 8598. 03 .00.00 
!Hits:100 

i Title: Introduction to Engineering and Engineering Study 
1 search-chaptertitle : engineering 
SECTION: 0130808598.03.02.00 
50 'Hits: 011 

[Title: What Is Engineering? 

! search-sectiontitle : engineering, study, skills 
[Rank: 24 

! search-idx: study , skills 
55 SECTION: 0130808598. 03. 03 .00 
[Hits: 001 

[Title: What Do Engineers Do? 
[Rank:22 



STL000023US1 



-119- 



! search-idx: study, skills 
SECTION: 013 0808598. 03. 04. 00 
!Hits:011 

! Title: Why Choose to Study Engineering? 

1 search-sectiontitle : engineering, study, skills 

•Rank:31 

! search-idx: study, skills 
SECTION: 0130808598. 03 .05.00 
!Hits:011 

] Title: Equipping Yourself for Engineering Study 
! search-sectiontitle : engineering, study, skills 
! Rank: 64 

! search-idx: study, skills 
SECTION: 013 08 0859 8. 03. 06. 00 
!Hits:001 

! Title: Cooperative Education Programs (Co-ops) and Internships 
! Rank: 24 

! search-idx: study, skills 
SECTION: 0130808598. 04. 03. 00 
IHits:010 

! Title: Making Effective Use of  the  Engineer ing  Prof essor 
! search-sectiontitle: engineering, study, skills 
SECTION: 013 0808598. 05. 04. 00 
!Hits:011 

! Title: Group Study 

I search-sectiontitle : engineering, study, skills 
!Rank:24 

! search-idx: study , skills 
SECTION: 013 0808598. 05. 06. 00 
!Hits:010 

! Title ; Engineering Ethics and Code of Student Behavior 
! search-sectiontitle : engineering, study, skills 
CHAPTER. C: 013 08 08598. 06.00. 00 
!Hits:100 

! Title: Key Strategies for Maximizing Performance in Engineering Courses 
1 search-chaptertitle : engineering 
SECTION: 013 0 80859 8. 06. 02 .00 
!Hits:001 

! Title: Time Management Strategies 
! Rank: 24 

! search-idx: study , skills 
SECTION: 013 0808598. 06. 03 .00 
!Hits:010 

! Title: Preparing for an Engineering Course: Making Sure Your Prerequisite 
Works 

! search-sectiontitle : engineering, study, skills 

SECTION: 013 08 08598. 06. 08. 00 

IHits:011 

I Title : Using Tutors and Study Guides 

I search-sectiontitle : engineering, study, skills 

1 Rank : 3 1 

I search-idx: study, skills 
SECTION: 013 08 08598. 07. 03 .00 
!Hits:001 

I Title: Preparing for Examinations: Getting Organized 
! Rank: 24 

1 search-idx: study, skills 
CHAPTER. C: 013 0808598 .10 .00.00 
!Hits:100 

i Title : Developing Engineering Skills 
! search-chaptertitle: engineering 
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SECTION: 013 0808598. 10. 02. 00 
!Hits:011 

! Title: Communication Skills 

I search-sectiontitle : engineering, study, skills 
•Rank: 47 

! search-idx: study, skills 
SECTION: 01308 08598. 10. 03 . 00 
!Hits:001 

! Title: Developing Your Creativity 
I Rank: 33 

! search-idx: study, skills 
SECTION: 0130808598. 11. 03 .00 
!Hits:001 

I Title: Continuing Your Studies Towards a Graduate Degree 
! Rank: 24 

! search-idx: study, skills 
CHAPTER. C: 0132 54749X. 02 .00.00 
!Hits:100 

! Title: Engineering and Electronic Worksheets 
! search-chaptertitle : engineering 
SECTION: 013254749X. 02 . 03 . 00 
!Hits:010 

! Title: The Engineering Method 

! search-sectiontitle: engineering, study, skills 

SECTION: 013 254749X. 02. 04. 00 

>Hits:010 

! Title: Issues in Engineering and Data Analysis 
! search-sectiontitle : engineering, study, skills 
CHAPTER. C : 013254749X . 05 .00.00 
!Hits:100 

I Title Engineering Computation 
! search-chaptertitle: engineering 
SECTION: 0132 54749X. 10 . 02 . 0 0 
!Hits:010 

! Title Engineering and the Internet 

! search-sectiontitle: engineering, study, skills 

The results set can contain entities at different levels in the hierarchy. For example, the 
previous search contains entities at the chapter and section levels. The preceding results are 
received by the search procedure in application layer 28, where they may be filtered (e.g., to 
remove any sections that do not belong to chapters that are hits), parsed, and displayed. An 
example of a results screen is shown in Fig. 2 IB. 



9* Using Hit masks to Search Hierarchical Data 

As noted in the previous section, the hierarchical data model presents an interesting 
challenge - how to perform a search across entities at different hierarchical levels. The 
challenge arises because the hierarchical entities can be stored without inherited properties. An 
entity's inherited property is one that is defined on one of the entity's parents. For example, a 
chapter entity has a property, name. Each section within the chapter inherits the name property. 
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The name property is stored only on the chapter entity. Due to storage space, query performance 
and load performance considerations, inherited properties are not stored on an entity. 
Suppose a user wishes to perform the following query. 

Find all the books written by Dr. Seuss containing a chapter title with the word "Cat" 

In this query the property, author, defined on the entity, book, is searched for an author, Dr. Seuss 
and a chapter entity's property, title, is searched for the word, Cat. At first glance, this query 
appears to be a normal parametric query with two Boolean conditions. However, after closer 
inspection this is not a normal parametric query because it involves attributes of entities of 
different hierarchical levels: Book and Chapter. For this reason, this query is broken into two 
parametric queries: 

1. Find those entities of type = book and author = Dr. Seuss. 

2. Find those entities of type = chapter and title containing the word "Cat" 

The two sets of results never intersect because they are for different entities. The first results 
consist of book entity matches, the second set of chapter entity matches. These entity matches 
for the query criteria are also called "hits". 

Since inherited properties are not stored within the datastore, the book entity and chapter 
entity do not share common properties. Therefore, if an intersection is performed on these two 
sets of hits, the resultant set is empty. However, the goal of the search was to find all the book 
entities satisfying both criteria. 

One of the keys to defining meaningful results for the two parametric queries described 
above is to define the entity expected in the results. For example, in the above query the book 
entity is returned. However, it may be desired to return the chapter entity to the user. Therefore, 
any proposed query interface must provide the application developer freedom to decide which 
entity is to be returned. 
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Examples of entities defined in the results include: 

■ Only the section hits that belong to chapter hits which in turn belong to book hits; 

■ The appropriate book hits, chapter hits and section hits, where all the section hits belong to 
one chapter hit which in turn belongs to one book hit; 

5 ■ A section hit that does not belong to any chapter hit; 

■ A chapter hit that does not belong to any book hit; or 

■ A book hit that does not contain any chapter hit or section hit. 



^4? 



s r 'F: 



This is really a question about how to perform logical AND/OR Boolean operators on the 
10 result sets across different entity levels. The answer varies depending on the needs of a particular 
application as well as the expectation from its target users. In order to accommodate different 
application needs and user expectations, the individual results are combined via a set union, and 
for each entity a bit stream known as a hit mask is included to indicate which query a specific 
entity satisfied. The application's query code does not need to know the specifics regarding the 
315 current hierarchical data model. The digital library API 16 provides the application's program 
;* :; with enough information so the application program can perform filtering on the hits and decide 
s *f the entity to be returned to the user. 

iss;;: 

f|j Referring to Fig. 6, the webserver 26 makes calls to an application program interface 

layer, which here comprises both application layer 28 and API layer 30. The application 

^20 program, in turn, makes calls to the digital library API 16. The digital library API 16 makes the 
program calls to the datastore to assemble the results into hit mask, unique identifier tuples. This 
hitmask-unique identifier tuple is described later. At present it is important to understand the 
breakdown of the logic with respect to program interfaces. 

As illustrated above, when there is a complex query against multiple entity types, it is 
25 necessary to break the query down into multiple queries, each against a specific entity type. Two 
queries can be performed on the same entity type, but the same query is never performed on 
multiple entity types. 

In the present embodiment, the queries are executed as parametric and free text queries, 
yielding multiple sets of results. The multiple sets are combined via a union into one single set. 
30 During the process of union, a hit mask is built for each entity processed to indicate which one or 
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more individual queries returned the entity. The total number of individual queries determines 
the total length of the hit mask, and each bit has an initial value of zero. 

If a hit is merged in from the results for individual query 1, the first bit is set to 1. If a hit 
is merged in from the results for individual query 2, the second bit is set to 1, and so on. If an 
entity is a hit in both results set, both bits will be set to 1 . For any particular entity type in the 
combined set, its corresponding hit mask will contain one or more Fs. 

A hit mask is a bit string, a string of 0 ? s and Fs. Each bit corresponds to one 
parametric/free text query against a specific entity. A bit with a value 1 indicates the entity is a 
hit for the corresponding query; a bit with a value 0 indicates the entity is not a hit for the query. 
In the previous example, the query was broken into two parametric queries: one query on the 
book entity and one on the chapter entity. The example below shows the queries being 
performed against the contents of a typical datastore with the resultant hit masks. 



Queries 

1 . Find those entities of type = book and author = Dr. Seuss. 

2. Find those entities of type = chapter and title containing the word "Cat" 



Datastore Contents 


Hit mask* 


Book - Cat In The Hat by Dr. Seuss 


10 


0123456789.00.00 




o Chapter - Tee Totaler 


00 


0123456789.01.00 




o Chapter - The Fat Cat 


01 


0123456789.02.00 




Book - Horton The Elephant by Dr. Seuss 


10 


0291123421.00.00 




o Chapter - 1 Meant What I Said 


00 


0291123421.01.00 




Book - Cannery Row by John Steinbeck 


00 



Unique 
Identifier 



0412856798.00.00 

* The left bit corresponds to query one and the right bit corresponds to query two 

The above hit masks represent the hits for the queries. For example, The Cat In The Hat 
book with author Dr. Seuss matches the first query, but not the second one. Therefore, the hit 
mask is 10. Likewise, the chapter, The Fat Cat, is a hit for the second query so its hit mask is 01. 

Each query result is returned with its unique sequence identifier, previously described. 
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Recall that the sequence identifier is in the form: XXXXXXXXXX.CC.SS., where the 10 X's are 
the ISBN number, CC is the chapter number and SS is the section number. Once the hit masks 
and unique identifiers have been returned to the application program from the API layer 30, the 
application program can perform a logical AND/OR upon them. The application program 
5 provides the freedom to choose how unions and intersections are performed on the hits. 

The manner in which the application program of the present embodiment uses the hit 
masks and unique identifiers to filter the combined set of results will now be described. In the 
example above involving The Fat Cat, a problem exists in performing a union or intersection on 
The Fat Cat chapter entity hit mask, 01 with the Cat In The Hat book entity hit mask 10. Since 
10 The Fat Cat is contained within the Cat In The Hat, it satisfies both queries. Its hit mask needs to 
inherit the properties of the hit mask of the parent book to have a value of 1 1, indicating that it 
%jg satisfies both queries. The Cat In The Hat contains a 1 for the first query is a book container 

jSSi 

entity. Thus bit representing query one in the hit masks of all entities that the book contains (i.e. 
J|{ children, grand children, etc.) should be set to "1". Recall, however, that inherited properties 
0 1 5 such as parenthood are not retained with the datastore. 

: £ 

~ The solution to this dilemma lies in the unique identifier's format, which contains 

information about the hierarchical relationship of these entities. The Cat In The Hat's unique 
fU identifier is 0123456789.00.00 and the unique identifier of The Fat Cat is 0123456789.02.00. By 
q matching all unique identifiers with the same XXXXXXXXXX values, these identifiers allow us 
^ ! 20 to determine that The Fat Cat is contained by The Cat In The Hat. Since The Cat In The Hat 

contained a 1 for the first query, The Fat Cat's and Tee Totaler hit mask are changed to values of 
1 1 and 10, respectively. The following two phase process can be employed to obtain this result. 

Phase One - Hashtable Creation. The first step is to build a nested hash table that 
reflects the relationships of the entity hits in terms of the hierarchical data model. This can be 
25 done based on the unique identifiers. For each entity in the results, place the hit mask in a nested 
hash table at the appropriate entity. The keys of the hashtable are unique identifiers and the 
values are the hit masks bit streams. The diagram below shows the hashtables for the previous 
example. Once all the entities are processed, a nested hashtable exists reflecting the containment 
relationships. 

30 
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<Book Hashtablo 

Key 1: 10 Value: <Chapter Hashtable> 

Key 1: 00 Value: <SectionHashtable>... 
Key 2: 01 Value: <SectionHashtable>... 

5 <Book Hashtable> 

Key 1: 10 Value: <Chapter Hashtable> 

Key 1 : 00 Value: <Section Hashtablex . . 

<Book Hashtable> 

Key 1: 00 Value: <Chapter Hashtablex.. 

10 

Phase Two - Filtering. Once the hashtable is constructed, application program uses 
simple Boolean logic to obtain the hits. In the example, the desired results are all books by Dr. 
Seuss containing a chapter with the word Cat in the title. The filtering process starts on the book 
^.15 hashtable keys looking for a hit mask of 10. There are two books:. The Cat In the Hat, Horton 
*iP The Elephant) satisfying this criteria. The filtering process continues on the chapter hashtables 
m within these two books looking for hit masks of 01. The chapter, The Fat Cat, is located and 
J therefore is a hit for the complex query. Thus the containing book's, The Cat In The Hat, unique 
O identifier is returned. 

ii rli 

« 20 One of the advantages of the above two phase process is the code reuse and flexibility of 

j*: the returned entity. For example, the chapter, The Fat Cat, can be returned instead of the 
[U contained book without changing the phase one process. The only changes required are to alter 
Q the end processing on phase two. This is important since the returned entity varies from website 
to website. 

25 Search and hit mask generation procedures may be performed by software and may be 

tangibly embodied in a program product such as a CDrom or floppy disk, or stored in a 
computer system's nonvolatile memory. The use of hit masks for hierarchical document models 
provides many benefits. It is an efficient solution from a performance perspective because a 
minimum amount of information is transmitted back to the Web application. In addition, the 

30 solution is configurable in that it permits decision making of OR, AND and returned entities at 
the application layer 28. The query code is generic and flexible enough for multiple hierarchical 
data models. Performance is improved due to the easy Boolean logic performed on hit masks to 
find results. And finally, this approach minimizes space requirements for storing the hierarchical 
data model by avoiding storage of inherited properties 
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10* System Administrator Interface and Functions 

A system administrator can perform a number of functions through interface 24. An interface 
application first displays a main menu of options. In the present embodiment, three options 251, 252, 
5 253 are available to the user as hypertext links, as shown in Fig. 22A. Two of these pertain to the 
content selection path of Fig. 6: "Approve User" 251 and "Make Products Available" 252. 

The "Approve User" option 25 1 is used to promote new users from a "guest" status to an 
"approved user" status. When a user registers with the compilation system for the first time, he is 
assigned a guest status that authorizes him to create and submit CBO's. However, a compilation will 
10 not be approved for publishing unless the user has been approved. The system administrator basis his 
f approval decision on a verification of the user as a valid user (e.g., as a bona fide university professor). 

The approval function is invoked by clicking on link 25 1 . This action invokes an approval 

sips 

Eg function in application layer 28 which issues a query to the User Table for all users whose "security" 

attribute is equal to "G"? The results are returned to the approval function, parsed, and displayed as a 
0 15 list of users and corresponding user information. An example of an approval list is shown in Fig. 22B. 

Attributes displayed include the user name 256, university 257, department 258 and registration date 
2 259. Beside each username is a select box. By checking desired ones of the select boxes and clicking 
I ^ on the "Commit" button 255, the system administrator causes the "security" attribute value in the User 
Q Table to be changed to "F' (full access authority). The user is then returned to the main menu of Fig. 
y 20 22A. 

The "Make Products Available" option 252 is used to designate prepublished content products 
available for use in compilations of content. When prepublished content is first loaded into digital 
library 20, it is assigned a default status of "unavailable". This status is indicated by a "U" value in the 
"Status" column defined by the Products Index Class. Unavailable products may not be included in a 
25 bookshelf, and may not be returned from a search content request. A prepubished product does not 
become accessible until this attribute value is changed to "A", indicating that the product is now 
available for use. 

The system administrator initiates the "Make Products Available" function by clicking on link 
25 1 . This action causes a corresponding function in application layer 28 to display a list of products. In 
30 the present embodiment, this list appears as shown in Fig. 22C. It includes the book titles 260, and a 
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corresponding select box 261 beside each title. By checking desired products for promotion and 
clicking on "Commit" button 262, the user invokes the EProductSetState function and causes the 
product state to change from "U" to "A". The EProductSetState function is described below. 

5 EProductSetState 

Description 

When a prepubiished content product is first loaded into the system, the status is Unavailable for 
each entity in the product. The web site's administrator then sets the status to Available when he 
10 feels confident that all the pieces of the product have been loaded completely and correctly. The 

administrative application uses this routine to set the product status to available. 

Functional Processing 

i-iO I. Locate the existing copyright product entity that matches the sequence id parameter. 

h;Q 15 II. Locate the entities within that product entity by finding entities that are prefixed by the same sequence id. 

s III. For each entity found, set its status to the product status passed ("0" = Unpublished, "1 " = Published). 

Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Sequence id 
Product state 


Outputs 




Errors 


RC_NOT_FOUND 
RC_SYSTEM_ERROR 



20 

Index Classes 
• Product 

25 11. Submitting a CBO 

When the user has completed creating his custom book outline, he may submit it to the 
approval process by clicking on "Submit Request" button 180. This action invokes the 
ECBOSubmit and ERequestStatus procedures described below. 
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ECBOSubmit 



Description 



This routine submits the CBO when the user has completed creating his book and is ready to 
submit the order. This function performs some policy checks against that CBO and then creates a 
Request in the Request entity. It then makes the CBO available to the System Administration 
application for approval. The Custom Publishing subsystem then retrieves the Request when it 
processes. 



Functional Processing 

I. Verify that the CBO id passed is valid. 

II. Check the volume virtual page count limit in the CBO. If total virtual page count for each of the volumes go 
above or below the volume virtual page count limit, the volumes are reorganized. 

III. Check the CBO minimum virtual page count limit in the CBO. Verify that the total CBO virtual page count 
exceeds the CBO minimum virtual page count limit. 

IV. Verify that the CBO contains at least one section from one prepublished content product. 

V. Create a new order request for this CBO with all the information passed in. 

VI. Update the new Order Request with information for all the volumes in the CBO. 

VII. Mark the CBO status as Submitted ("2") with a call to ERequestStatus. 
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Parameter List 



Inputs 


DSUsername 




DSPassword 




DSHostname 




Userid 




Author title 




Author first name 




Author last name 




University 




Department 




CBO id 




Course name 




Course number 




Course registration number 




Estimated enrollment 




Term 




Term start date 




Bookstore date 




Desk copy count 




Supplement count 




Alternate address 1 




Alternate address 2 




Alternate address 3 




Alternate city 




Alternate state / province 




Alternate zip code / postal 




Alternate country 


Outputs 


New request order number 


Errors 


RC_NOT_FOUND 




RC_INVALID_CBO 




RC_UNDER_MINIMUM 




RC_NO_PRE_PUB_SECTION 




RC_SYSTEM_ERROR 







Index Classes 

• CustomBookOutline 

• Request 

• RequestAux 
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E Req uestU pdateStatus 

Description 

When a user's Order Request moves through the workflow from creation to approval/return/ 
rejection to completion, the approval status of the Order Request is changed in the digital library 
Request Index class to reflect its progress. 

Functional Processing 

I. Search for one Order Request with the Request id passed. 

II. Verify that there is exactly one match for the Request id. 

III. Update the status of the Order Request found. 

Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Request id 
Request status 


Outputs 




Errors 


RC_SYSTEM_ERROR 



Index Classes 
• E_Request 

The following support function is also provided in application layer 28 and pertains to the 
creation of a CBO. 

EGetProgramAttributes 

Description 

This retrieves attributes for a particular program from the Program entity. 
Functional Processing 

I. Search for any program record with the program id passed. 

II. Verify that there is exactly one match for the program id. 
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III. Retrieve all the attributes from the program record. 

IV. Export all the attributes as one big string into the results buffer. 

Parameter List 



5 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Program id 


Outputs 


Results buffer 


Errors 


RC_SYSTEM_ERROR 



Index Classes 

• Program 
% • ProgramAux 

fio 

. ifrs 

m C. Request Processing Path 

;« The content management system of the present invention includes a backend path for 

^ submitting a compilation for approval and publication, as shown in Fig. 23. This path begins 
j^l 5 with a user's submission of a CBO for approval previously described. 

As shown in Fig. 23, the request processing path includes a system administration facility 
O for retrieving submitted CBO from the digital library content server(s) 18 through API layer 30, 
and placing the pending CBO content into an unformatted Product Structure File. The system 
administration facility 24 presents the PSF into a format easily viewed by the editor. If the editor 
20 approves the CBO, the state of the CBO is updated to approved. 

A background application is run periodically to process all approved CBOs which have not 
been published. This background application, request generator 32, retrieves a list of all approved 
CBOs awaiting publishing. If the CBO is approved and the submitter (author) has been approved 
(a.k.a. granted full access) then the CBO is extracted from the data store 20 via the API layer 30 in the 
25 PSF format. An ISBN number is assigned to the CBO and all related volumes. This ISBN number is 
stored within the datastore in the Request (i.e. CBOISBN) and RequestAux (i.e. VolumelSBN) entity. 
This PSF format is stored into one or more files on the local file system and the CBO status is updated 
to published . Due to the newly assigned ISBN values the resultant PSF sequence ids 
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(XXXXXXXXX.CC.SS) within the PSF are not correct. These files still retain the original pre-pub 
book ISBN values. The formatter is responsible for updating these sequence ids in the following step. 

Another background application, formatter 34, is run periodically to process the PSF files 
residing on the request generator's 32 file system. Therefore, this application must have access to 32's 
file system. This can be achieved via NFS clients or the like. The formatter scans the files within the 
request generator file system. Upon locating a PSF file, it is parsed and the sequence ids are retained. 
The formatter then queries the data store 20 via the API layer 30 to extract the real content for this 
sequence id (a pre-published book or UPM). Upon retrieving the real content, the sequence id is altered 
to reflect the CBO's ISBN number, chapter and section numbers. The new sequence id is stored in the 
PSF file and the content is stored into files residing in the same directory. 

Product Generator 36 receives the input CBO files and reformats them into a desired 
publishing format. In the present example, the desired publishing format is aFramemaker 
format, although conversion to other formats may be supported in addition to or in lieu of 
Framemaker format conversion. The product generator of the present embodiment is a 4GL 
application. 

The resultant CBO Framemaker files are now forwarded to publishing system 38, a 
printing and distribution system that uses a Framemaker file set in printing and distributing text 
books. 

CBO Workflow 

Status Values. The following table lists the acceptable CBO Status and the Request Status 
values that are assigned to the "ApprovalStatus" columns in the CBO and Request index classes of 
digital library 20. 




0 Active Available to the instructor to revise 

1 Submitted Submitted by the instructor 

2 Approved Approved by editorial and ready for publishing 

3 Rejected Rejected by editorial and no longer able to be revised 
4 Completed Completed custom publishing 
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CBO Workflow States. Fig. 24 is a CBO status workflow state diagram illustrating the 
various states of the Request approval status, CBO approval status, and user access level (the value 
stored in the "Security" column of the User Table) that affect the workflow of a CBO through the 
described process. It shows the values of each variable at different states and the action that changes the 
5 state. 

1. System Administrator Approval Process 

As previously discussed, Custom Book Outlines are created through the web user interface 22 
from pre-published content and user-provided material, then submitted for approval. During the submit 
10 process, the user enters additional information or override information through the web pages. When 
he has entered all the required information, the request is stored in digital library 20 as a Request entity, 
=| and the CBO status is changed from "1" (Working or Active state) to "2" (Submitted state) in the CBO 
j|J Index class (Approval Status attribute). 

] Jf An editor reviews pending requests through the system administrator web interface 24 by 

HI 5 selecting the "Edit/Review/Approve CBO's" option 253 of Fig. 22A. This action invokes an approval 
f s process that issues a call to the ECBOIist function. The ECBO List function, described below, issues a 
query to digital library 20 for all CBO's with a status equal to "2". 

ECBOList 

^920 Description 

This function returns a list of all CBO's that are stored in the digital library for a specified Program 
that are in a specified status in the CBO workflow (e.g., all "Active" CBO's in the Freshman 
Engineering program). The status and program are passed as parameters. This function is used 
25 by the system administration facility to obtain all CBO's in a submitted state. It is also used by the 

request generator 32 to obtain all CBO's in an approved state. 

1 . Verify that program id passed is valid. 

2. Search for CBO's with the program id and the CBO status specified. 

30 3. For each CBO found, send back the CBO id, the Type, Title, and Creator ID. 
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Parameter List 



Inputs 


DSUsername 




D5>rassword 




DSHostname 




Program id 




CBO state 


Outputs 


CBO id list 


Errors 


RC_INVALID_OR_MISSING_INPUT 




RC_SYSTEM_ERROR 



5 Index Classes 

• CustomBookOutline 
m • Program 

1 0 The CBO' s returned by the ECBQList function are parsed by the approval process and 

|f displayed on a Submitted Custom Books screen. An example of an approval screen is shown in Fig. 
Q 22D. It includes a list of CBO titles 27 1 and owners 272, where the titles provide hypertext links to the 
" r CBO's themselves. Clicking on a CBO title invokes an approval process that calls the ECBOGet and 
;j*f ECBOGetUPM functions. The returned CBO and user content are parsed and displayed to the editor 
HJ 15 on an approval screen. 

ih A portion of the approval screen of the present embodiment is shown in Fig. 22E. It presents 

^ the editor with the custom book outline that includes the content associated with each UPM element 
For example, in the CBO shown Chapter 1 contains a first UPM element 288 (UPM refers specifically 
to "Instructor Provided Text"). The UPM title ("New UPM Section") is displayed at 284 as a hypertext 
20 link. It is also provided in an editable title field 287, and the UPM's corresponding content ("UPM 

Content") is contained in an editable field 285. Thus the editor is able to review and edit the content for 
correctness and appropriateness. He may also insert editorial suggestions and comments pertaining to 
this UPM into field 286. In addition, prepublished section titles are provided as hypertext link to their 
corresponding content. If desired, the editor may view the content of any section by clicking on its link 
25 When the editor has completed his review, he may approve, reject, or return the CBO with 

editorial comments by clicking on one of the buttons 281, 282, 283 at the top of the screen. In the event 
of rejection, the CBO approval status stored in the digital library CBO Index class is changed to "3" by 
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the ECBOSetState function. The CBO will now be displayed on the user's Welcome screen with a 
"Reject" status, and the owner will net be permitted to manipulate the book any further. A CBO return 
causes ECBOSetState to set the CBO approval status value back to 'T\ and the editorial comments to 
be sent to the owner via e-mail. In this case the owner is allowed to alter the book in an attempt to 
5 correct situations identified by the editor. Lastly, CBO approval status to "2". All CBO's with an 
approved status will be retrieved by the Request Generator 32. 

2 Request Generator 

Description. The Request Generator 32 application polls digital library 20 to determine which 
10 submitted requests are in an approved state but still are not extracted. It will only publish those CBO's 
having an approval status of "2" and an owner status of "F" (i.e. Full Access). An administrator 
h|j invokes the request generator 32 on the command line using the ra.pl executable. The ra.pl executable 
X calls the rasetup.pl script which queries the digital library 20 for CBO's having a CBO Status equal to 
Jg "2" and an owner's Access Level equal to "F\ For these CBO's, it sets the Request approval status to 
p 1 5 "2" in the Request Index class Upon completion of rasetup.pl, the ra.pl procedure is invoked and places 
7 all CBO's having a Request Status equal to "2" into the /request/esource/order directory via the 
j=«f ERequestlist function described below. This directory is later accessed by an EPGSetup NT program 
f y invoked by formatter 34. 

j ^ The Request Generator 32 creates an Intermediate Product Specification File (INT), a Cover 

W20 Page Customization file (CPC), and a Title Page Customization file (TPC). Request Generator 32 also 
invokes a procedure called rgisbn to generate an ISBN for each custom volume within a custom book, 
and one for the CBO itself. For example, a custom publication with two volumes will generate three 
total ISBNs. The new ISBN(s) are stored in digital library 20 by the ERequestlSBN procedure, 
described below. 

25 

Inputs 

• Digital Library Requests 

• Digital Library attribute data (Program, Product) 
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Outputs 

• Intermediate Product Specification File: /request/esource/order/<short-isbn>/<short-isbn>.INT 

• Cover Page Customization file: /request/esource/order/<short-isbn>/<short-isbn>.CPC 

• Title Page Customization file: /request/esource/order/<short-isbn>/<short-isbn>.TPC 

• Request Notification message 

• New Title Setup message 

• Request Order message 

• Desk Copy Order message 

• Desk Copy Approval message 

Functional Processing 

ra.pl 

1. lnvokerasetup.pl 

a) Determine which Requests have been submitted and approved via ERequestList, then 
retrieve the CBO definition for each of these Requests. 

b) Find Requests with a CBO Status equal to "2" for users with Access Level equal to "F" 

c) Change the Request Status to "2" for those found 

2. Invoke rgPR.pl 

a) Use a subset of the CBO attributes to build the INT file 

b) Use data in the CBO to build the CPC file 

c) Use data in the CBO to build the TPC file 

3. Assign an ISBN to the custom publication and store it into digital library index classes via a call 
to ERequestlSBN 

a) Read next ISBN number from the ISBN file 

b) Remove ISBN number from the ISBN file 

c) Assign an ISBN to each volume in the CBO 

d) For multi-volume CBO's, assign an ISBN to the Value Pack containing all volumes 
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4. Extract UPM from Digital Library 

a) Retrieve UPM 

b) Modify UPM to treat the owner name as an author attribute on the <UPM> markup tag. 

5. Perform royalty calculations 

a) Sum the page counts for each pre-published ISBN included in the CBO to get a Total 
ISBN Page Count 

b) Sum the page counts for UPM, Preface, TOC, and Index to get the Total ISBN Page 
Count for the generated ISBN 

c) Calculate a percentage for each ISBN, both pre-published and custom, by dividing the 
Total ISBN Page Count by the CBO Page Count 

6. Invoke rggen.pl 

a) Format and send the Request Notification message 

b) Format and send the New Title Setup message 

c) Format and send the Request Order Info message 

d) Format and send the Desk Copy Order message 

e) Format and send the Desk Copy Approval message 

7. Change the Request Status and the CBO Status to "4" (completed) 

ERequestList 

Description 

This routine returns a list of the Order Requests with a particular Request Status. In most cases, 
this will be looking for a Request Status of "2", which is approved. 

Functional Processing 

I. Search for Order Requests with the Request status passed. 

II. For each Order Request found, return the Request id, User id, CBO id, Book store date, and estimated 
enrollment. 
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Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Request status 


Outputs 


Order Request list 


Errors 


RC_SYSTEM_ERROR 



Index Classes 
• Request 

ERequestAssignlSBN 

Description 

When an Order Request is processed by the Request Generator, a new ISBN is generated and 
assigned to the Order Request. This routine stores the ISBNs into the Request and RequestAux 
Index Classes. 

Functional Processing 

I. Search for one Order Request with the Request id passed. 

II. Verify that there is exactly one match for the Request id. 

III. If the volume id passed is null, there is only one volume involved. Update the package ISBN with the newly 
assigned ISBN value. 

IV. If the volume id passed is not null, locate the volume specified and update its ISBN with the newly assigned 
ISBN value. 



Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Request id 
Volume id 
Assigned ISBN 


Outputs 




Errors 


RC_SYSTEM_ERROR 
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Index Classes 

• Request 

• RequestAux 

Other Back-End Support Functions 

Other API layer 30 functions invoked by the Request Generator 32 are described below. 

ERequestGet 

Description 

This returns the attributes for a specific Order Request from the Request entity. 
Functional Processing 

L Search for one Order Request with the Request id passed. 

II. Verify that there is exactly one match for the Request id. 

III. Retrieve all the attributes for the Order Request. See Attribute Format below for more details. 
Parameter List 



Inputs 


DSUsername 
DSPassword 
DSHostname 
Request id 


Outputs 


Order Request description 


Errors 


RC_SYSTEM_ERROR 



Index Classes 

• Request 

• RequestAux 
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Attribute Format 

!<attmame>:<valuexcarriage retum> 
Example: 



!ApprovaiStatus:1 

!UserlD:testprof 

!CBOID:XT3QH7YOI$HFBS1 6 

!AuthorTitle:ms 

!AuthorFirstName:jan 

!Authorl_astName:murray 

!University:purdue 

IDepartmentcs 

!CourseName:intro 

!CourseNumber:101 

!CourseRegNumbercs1 01 

!Term:Fail1999 

!TermStartDate:1 9990912 

!BookStoreDate:1 999081 5 

!AltAddress1:41 Waldron 

!AitAddress2:MS717A 

!AltAddress3:-alt-3- 

!AltCity:W. Lafayette 

!AltState:IN 

!AltZip:4761 0-5555 

!AltCountry:537020568 

!EstEnrollment:175 

!DeskCopyCount:4 

!SuppCount:4 

! ReceiptDate:1 99903271 40527 

ITotal Price: 

ICPDPrice: 

IValuePackPrice: 

ICPDISBN: 

IPackagelSBN: 0130296686 
IVolume 

!VolumeiSBN:013029666X 
!VolumelD:V1 

'Volume 

!VolumelSBN:0130296678 
!VolumelD:V2 

Note: This CBO is a Multi-Volume CBO containing 2 volumes. Each volume has an ISBN (i.e. VolumelSBN) and the 
entire CBO has an ISBN (i.e. PackagelSBN) 



ERequestExistsForCBO 

Description 

Check to see whether there is any Request Order corresponding to the CBO id of interest. 
Functional Processing 

I. Search for any Order Request with the CBO id passed. 

II. Return 1 if there is at least one match. Return 0 if there is no match. 
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Parameter List 



Inputs 


DSUsemame 
DSPassword 
DSHostname 
CBOid 


Outputs 


Found? (0,1) 


Errors 


RC.SYSTEMJERROR 



5 Index Classes 
• Request 



±10 



01 



5 f-'i 



ul5 



ERequestGetForCBO 

Description 

Ids of all the Order Requests that correspond to the CBO id are returned in a list. 
Functional Processing 

I. Search for any Order Request with the CBO id passed. 

II. For each Order Request found, save its Request id. 

Parameter List 



Inputs 


DSUsemame 
DSPassword 
DSHostname 
CBOid 


Outputs 


Request id list 


Errors 


RC_SYSTEM_ERROR 



Index Classes 
20 • Request 
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Intermediate Product Specification File (INT). For each volume, an Intermediate Product 
Specification File (INT) is generated by Request Generator 32. The file is named <short-isbn>.INT and 
is written to an Order Directory in a subdirectory named <short-ISBN>. Request Generator creates this 
file by retrieving the Request and the CBO from digital library 20. A sample file format is shown 
below. 



PRODUCT:<newisbn>.00.00.00 

FRONT_AND_BACK_ELEMENT: 

FRONTAND_BACK_ELEMENT: 

CHAPTER.C: 

SECTION:<oldisbn>.nn.nn.nn 
UPM_SECT10N:mm 
!UPM_SECTION_AUTHOR:Author Here 
SECTION:<oldisbn>.nn.nn.nn 

CHAPTER.C: 

FRONT_BACK_ELEMENT:<oidisbn>.nn.nn.nn 
SECTION:<o!disbn>.nn.nn.nn 
SECTION:<oldisbn>.nn.nn.nn 
FRONT_BACK_ELEMENT:<oldisbn>.nn.nn.nn 

CHAPTER.C: 

SECTION:<oldisbn>.nn.nn.nn 
SECTION:<oIdisbn>.nn.nn.nn 
SECTION :<oidisbn>.nn.nn.nn 

FRONT_AND_BACK_ELEMENT: 



In the INT file, <newisbn> is the ISBN for the volume, <oldisbn> is the ISBN of the book the section 
originally came from, nn.nn.nn is the chapter and section number of the section as it appeared in the 
original book, and mm is sequence id. 

UPM Content (seqid). For each UPM content element, the formatter 34 PG Setup procedure 
calls ERetrieve to get the content and then writes a content file. This file is named <sequence id>.SGM 
is written to the PG Done Directory in a subdirectory named <short-ISBN>. The UPM is be modified to 
treat the owner name marked within the INT file as an author attribute on the UPM markup. 
For example: 



Original UPM 

<UPM> 

<TITLE>Digital Fundamentals</TITLE> 
<text>Text UPM follows here</text></UPM> 
<UPM> 

<TITLE>Digital Fundamentals ll</TITLE> 

<code> 

ifa=b 

then do this 
else do that 
</code> 
</UPM> 
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Modified UPM 



<UPM instructor="Joe Professor"' 

<TITLE>Digital Fundamentals<ATITLE> 

<text>Text UPM follows here</text></UPM> 

<UPM instructor-"Joe Professor^ 

<TlTLE>Digital Fundamentals li</TITLE> 

<code>if a=b 

then do this 

else do that 

</code> 

</UPM> 

3. Formatter 

Description. The Formatter 34 process is defined by the EPGsetup procedure. EPGsetup 
retrieves CBO content from digital library 20 and places it into a file system directory for use by the 
Product Generator 32. EPGsetup then reads the INT file created by Request Generator 32 in the 
previous step and calls ERetrieve to get the content elements from the digital library 20. The content, 
POF, CPC, and TPC files are placed in the PG directory, in a subdirectory named <short-isbn>, which 
is based on the ISBN of the custom publication. 

Inputs 

• Intermediate Product Specification File: e:\order\<short-isbn>\<short-isbn>.INT 

• Cover Page Customization File: e:\order\<short-isbn>\<short-isbn>.CPC 

• Title Page Customization File: e:\order\><short-isbn>\<short-isbn>.TPC 

• UPM Content: e:\orde A<short-isbn>\<upm-id>. UPM 
Outputs 

• Product Specification File: e:\pg\<short-isbn>\<short-isbn>.PSF 

• Product Override File: e:\pg\<short-isbn>\<short-isbn>. POF 

• Cover Page Customization File: e:\pg\<short-isbn>\<short-isbn>.CPC 

• Title Page Customization File: e:\pg\<short-isbn>\<short-isbn>.TPC 

• SGML Content Files: e:\pg\<short-isbn>\<sequence-id>.SGM 

• SGML Attribute Files: e:\pg\<short-isbn>\<sequence-id>.ATR 
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• Associated Component Content Files: e:\pg\<short-isbn>\HiRes\<sequence-id>.<extension> 
(TIF or EPS) 

• UPM Content: e:\pg\<short-isbn>\<sequence-id>.UPM 
Functional Processing 

EPGsetup 

1 . Poll the e:\order directory to find any <short-isbn> directories created 

2. Process each INT file found in those <short-isbn> directories 

3. Verify that each non-empty chapter begins with a FRONT_AND_BACK„ELEMENT. If it does 
not, then add one in place. 

4. Convert the INT into a PSF file 

5. Invoke ERetrieve to get the content, including SGML, ATR files, and Associated Components 
for each line item in the PSF 

6. Make UPM located at the beginning of the book into its own chapter 

7. Make UPM located at the end of the book into its own chapter 

8. Make UPM located immediately before any chapter into its own chapter 

9. Renumber the PSF file to be well-formed 

a) Uses the custom book ISBN number as the high-level digits 

b) Sequence the sequence ids into the standard hierarchical structure 

c) Rename the pre-published content file names to match the new sequence numbers 

d) Rename the UPM content file names to match the new sequence numbers 

1 0. Move files into the e:\pg\<short-isbn> directory 

1 1 . Execute Product Generator 



Product Specification File (PSF). For each INT file, a Product Specification File (PSF) is 
generated by EPGsetup. This file is named <short-ISBN>.INT and written to the PG Directory in a 
subdirectory named <short-ISBN>. The PSF is based upon the INT file from the Request Generator 
32. The PSF has the following format: 
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PRODUCT:newisbn.00.00.00 

FRONT_BACK_ELEMENT:newisbn.cc.ss.OO 

FRONT_BACK_ELEMENT:newisbn.cc.ss.OO 

CHAPTER.Gnewisbn.cc.OO.OO 

FRONT_BACK_ELEMENT:newisbn.cc.ss.OO 
SECTiON:newisbn.cc.ss.OO 
UPM_SECTION:newisbn.cc.ss.OO 
SECTION:newisbn.cc.ss.OO 

CHAPTER.C:newisbn.cc.00.00 

FRONT_BACK_ELEMENT:newisbn.cc.ss.OO 
SECTiON:newisbn.cc.ss.OO 
SECTiON:newisbn.cc.ss.OO 
FRONT_BACK_ELEMENT:newisbn.cc.ss.OO 

CHAPTER.C:newisbn.cc.OO.OO 

FRONT_BACK„ELEMENT:newisbn.cc.ss.OO 
SECTION:newisbn.cc.ss.OO 
SECTJONrnewisbn.cc.ss.OO 

SECTION:newisbn.cc.ss.OO 

FRONT_BACK_ELEMENT:newisbn.cc.ss.OO 



where newisbn is the ISBN for the volume and cc is the chapter number, and ss is the section number. 
Chapter numbers should be sequential within a volume and should start with 1 . Section numbers 
should be sequential within a chapter and should start with 1 

Product Override File (POF). During creation of a custom book, a user may change the title 
of a pre-published chapter. If a volume contains such a change, PG Setup will generate a Product 
Override File (POF) for each volume to specify any overridden chapter titles. This file is named 
<short-ISBN>.POF and is written to the PG Directory in a subdirectory named <short-ISBN>. The 
POF references a line number of a chapter tag within the final PSF sent to Product Generator 36. The 
file has the following format: 



N:TITLE:xxxxxxxxx 
N:TITLE:xxxxxxxxx 



where N is the line number of the final PSF that contains the chapter tag to override and xxxxxxxxx is 
the new title for that chapter. For example, suppose the owner of the PSF below has changed the title of 
Chapter 2 to "My Sample Chapter". The POF for achieving this result is shown below. 
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Product Specification File (PSF) 



1 PRODUCT:0123456789.00.00.00 



2 FRONT J3ACK_ELEMENT: 01 23456789.01 .01 .00 

3 FRONT_BACK_ELEMENT: 01 23456789.01 .02.00 
5 4 CHAPTER.C:01 23456789.02.00.00 

5 FRONT_BACK_ELEMENT: 01 23456789.02.01 .00 

6 SECTION: 0123456789.02.02.00 

7 UPM_SECTION: 01 23456789.02.03.00 

8 SECTION: 01 23456789.02.04.00 
10 9 CHAPTER.C:01 23456789.03.00.00 

10 FRONT_BACK_ELEMENT: 0123456789.03.01.00 

1 1 SECTION: 0123456789.03.02.00 

1 2 SECTION: 01 23456789.03.03.00 

13 FRONT_BACK_ELEMENT: 0123456789.03.04.00 
15 14 CHAPTER.C:01 23456789.04.00.00 

15 FRONT_BACK_ELEMENT: 0123456789.04.01 .00 

1 6 SECTION: 01 23456789.04.02.00 

17 SECTION: 0123456789.04.03.00 

1 8 SECTION: 01 23456789.04.04.00 

20 19 FRONT_BACK_ELEMENT: 0123456789.04.05.00 

20 FRONT_BACK_ELEMENT: 01 23456789.05.00.00 



jg Product Override File (POF) 

'% 9:TITLE:My Sample Chapter 

Si 25 

t$ Pre-published Content. For each content element of type SECTION or 

^ FRONT_AND_BACK_ELEMENT, PG Setup calls eRetrieve to get the content and writes a content 
M file. This file is named <sequence id>.SGM and written to the PG Directory in a subdirectory named 
12 <short-ISBN>. ERetrieve also gets associated components and places them in the PG Directory in a 
M30 subdirectory named <short-ISBN>/HiRes. 

The present invention has been described with reference to a particular embodiment in order to 
facilitate the reader's understanding of its broader applications. It shall therefore be understood that the 
scope of the invention is not limited by the above disclosure, but is entitled to the full breadth of the 
claims below. It shall also be appreciated that certain modifications to the methods and designs 
35 disclosed herein may occur to the skilled artisan without departing from the spirit and scope of the 
invention as claimed. 
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CLAIMS 



What is claimed is: 

5 1. A method for facilitating creation and manipulation of compilations of content, 
comprising the step of: 

Providing a functional layer for interfacing with a user interface and a data repository 
containing a plurality of content entities, the functional layer comprising a plurality of function 
10 modules, each module for executing a function pertaining to the creation or manipulation of a 
compilation of content. 

^ 2. The method of claim 1 , wherein one of the function modules creates a list of content 

m entitiy identifiers defining the content and order of a compilation. 

nil 

^15 

H 

^ s 3. The method of claim 2, wherein at least one of the function modules manipulates the list 
O to redefine the content or order of the compilation. 

ii"; ■: 

' 4. The method of claim 1 , wherein the compilation is hierarchically structured and wherein 
CJ20 one of the function modules creates an outline of containers and content entitiy identifiers 
defining the content and hierarchical structure of a compilation. 

5. The method of claim 4, wherein at least one of the function modules manipulates the 
outline to redefine the content or structure of the compilation. 

25 
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6. A program storage device readable by a machine, tangibly embodying a program of 
instructions executable by the machine to perform method steps for facilitating creation and 
manipulation of compilations of content, comprising the step of: 

5 providing a functional layer for interfacing with a user interface and a data repository 

containing a plurality of content entities, the functional layer comprising a plurality of function 
modules, each module for executing a function pertaining to the creation or manipulation of a 
compilation of content. 

10 7. The method of claim 6, wherein one of the function modules creates a list of content 
entitiy identifiers defining the content and order of a compilation. 

. 

"m 8. The method of claim 7, wherein at least one of the function modules manipulates the list 
to redefine the content or order of the compilation. 

I 15 

it 

^ 9. The method of claim 6, wherein the compilation is hierarchically structured and wherein 

one of the function modules creates an outline of containers and content entitiy identifiers 
ly defining the content and hierarchical structure of a compilation. 

^20 10. The method of claim 9, wherein at least one of the function modules manipulates the 
outline to redefine the content or structure of the compilation. 

11. A system for facilitating creation and manipulation of compilations of content, 
comprising: 

25 

a functional layer for interfacing with a user interface and a data repository containing a 
plurality of content entities; 

a plurality of function modules in the functional layer, each module including means for 
30 executing a function pertaining to the creation or manipulation of a compilation of content. 
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12. The system of claim 1 1, wherein one of the function modules further comprises means for 
creating a list of content entitiy identifiers defining the content and order of a compilation. 

13. The system of claim 12, wherein at least one of the function modules further comprises 
means for manipulating the list to redefine the content or order of the compilation. 

14. The system of claim 1 1, wherein the compilation is hierarchically structured and wherein 
one of the function modules further comprises means for creating an outline of containers and 
content entitiy identifiers defining the content and hierarchical structure of a compilation. 

1 5. The system of claim 14, wherein at least one of the function modules further comprises 
means for manipulating the outline to redefine the content or structure of the compilation. 
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PROVIDING A FUNCTIONAL LAYER FOR FACILITATING 
CREATION AND MANIPULATION OF COMPILATIONS OF CONTENT 



ABSTRACT 



A web-based system, method and program product are provided for adding content to a 
content object stored (e.g., a custom compilation or prepublished work) in a data repository as a 
group of hierarchically related content entities. Each noncontainer content object is preferably 
stored as a separate entity in the data repository. Each content entity is also stored as a row in a 
digital library index class as a collection of attributes and references to related content entities 
and containers. As the user selects desired objects for inclusion in a content object, the system 
arranges the objects hierarchically, e.g., into volumes, chapters and sections according to the 
order specified by the user. The system then creates a file object (e.g., a CBO) defining the 
content object that contains a list or outline of the container and noncontainer entities selected, 
their identifiers, order and structure. This file object is stored separately in the data repository. 
An aspect of the invention is the provison of a functional layer between a user interface and the 
data repository for facilitating the creation, manipulation, storage and management of content 
objects in the data repository. 
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Login 



Please enter your Username and Password below. 

If you have not registered with us, please Register now to gain instant 
access to all ESource features. 

Forgotten your Username or Password? Try Remember Me. To change 
your existing profile, use the Change Profile option. Please note that: 

• Login requires valid Username AND valid Password. 

• Remember Me requires valid Username OR valid Password, 

• Change Profile requires valid Username AND valid Password. 
Note: Both usernames and passwords are case sensitive. 



Username 
Password 



r 



{ 



\ot. 
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Register 



Registration, Page 3 of 3 

PJrwe provide some additional information about yourself* The asterisk 
symbol {*} identifies required entries. 

* University ENTER YOUR UNIVE RSITY H ERE j 

* Department _j 



* Email Address 



Your £matt aco*ress is important. All future communications will rely 
on your Email address, including our ability to send you information 
on you custom ojoOucts and use* profile. 



Title 

* First Name 

* Last Name 

* Address Line 1 
Address Line 2 
Address Line 3 

*City 

* State/Province 

* ZIP/Postal Code 

* Country 
Phone 



Piease Seieci 



95141 



United States fi\ 



i 



E 



J 



In order to assist you in remembering your profile in the future please 
provide us with a Secret Word Type (description or hint) and a Secret 
Word, You will be able to sec this category in the future by providing your 
email address or Username. 

* Secret Word Type 



For example: Mothers Maiden Name, City of Birth. 
♦ Secret Word . J 



Secret Word 
(Verify) 



J 



Continue 



11-20 !W .V3J I'M 
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Register 



Thank You for Registering 

We have received your registration information and will allow 
you to access ESource with Guest Access. Please print this 
page for your reference. Your Profile is as follows: 



E-mail: ingridf@us.ibmxom 

Name: Ingrid Foerster 
University; Santa Teresa Labs 
Department: 346 

Address: 

555 Bailey Ave, 



Santa Teresa, CA 95141 
United States 
Phone: 

Secret Word Type: Pet name 
Secret Word: ***** 

You now will be able to view, search, select, and build custom publications. 
Once your registration is validated by Prentice Hail you will automatically 
receive a Full Access password, full Access is necessary to receive an ISBN 
to fulfill an order for any book you build. You may submit an order with 
Prentice Hall before you receive Full Access, but we will need to validate 
your registration before sending you an ISBN for your custom publication. 

To begin using ESource please login with your new Username and 
Password, Please contact your Prentice Hal! Representative if you 
experience any difficulty. 





Ill- 




U-20'19V9 1-51 PM 



komc User 
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Welcome 



Welcome Ms. Ingrid Foerster 

: University j Santa Teresa Labs 
! Department ] 346 



i Access Level ! Guest Access 

i ! This level allows you to create custom products and 



! place orders requests. 



Your Custom Book Outlines -^i«3 



My Custom Book Title 

T 



Last Date 



Status Request ID 



My Now £Sourcc Book 

£&»^.W.t^iVJVW<|i j 12/20/1999 j Active 
16:43:17 1 



N/A 



To create new Custom Book Outline 
click on the red "CBO" button here. 




To start building your outline, click on Ufer$ry to select ESource content for 
your custom book. 

Users who are returning to work on active outlines should select their 
outline from the list above. 

Need more help? Click here far a qu ick Ho w To f or go to fct£!p for more 
extensive information, 

019(38. 1930. 2000 m taw ;ct Hall. I* 
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Welcome Homeboy Fiash Gordon 

Approaching 
Body Building 

Access Level 1 Full Access 

This level allows you to create custom products and 
place orders requests. 



Your Custom Book Outlines ^ ,ri 

Last Date 



My Custom Book Title 

New Custom Book 

Boys don't cry 



V 



What do you mean? 



Flash, I love you 



He'd save everyone 
one of us 



iO/28/2000 



09/02/1999 



Status 


Request ID 


Rejected 


ESOU001051 



Submitted ESOU001057 



Completed 



ESOU002051 « 



09/02/1999 



Completed 



r 



ESOU00105S 



08/19/1999 



i :_j 

Approved ESOU002009 



12 hours to save the 
earth 



Holy Smokes Batman 

ARally Really 
ajlyl^OlLLong 
Long Long Long 
Long. Long Long 
Longl^ngLong 
1 .ona Lona L onu 
LojigJ^ong ! .ong 



r 3 . n 



09/23/1999 ! Completed 



07/23/1999 i Active 



07/23/1999 



ESOU002373 



N/A 



Active 



N/A 
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Your Book Snapshot 



Section*; 4 | P*ge«: 20.0 j PHc«: US$ 2.00 

HI 



To select another Custom Book Outline, 
or to create new Custom Book Outline, 
click on the red "CBO" button here. 




This is Your Custom Book Outline 

Book Title 

My New ESource Book created on 
12/20/1999 at 18:47:54 



Contents 
VOLUME 1 

Table of Contents 

Preface 

Index 



}2S 

Price Pgs.* 



> 130 



J 



0.60 
0.90 
0.50 



6 
9 
5 



TOTAL 



$2.00 20.0 

♦Approximate 



Type the name of a chapter you would like to add 
to this book and press the "Add" button. 
Add new chapter with this title 

I Y ni 



AM 



Select either book or chapter from the drop down 
list, type a new name in the text box and press the 
"Change" button to change the title. 

Change Bex* |rj title to Int ernet Basics} 



T 



as 



I 



GfiMDfi* 



177 



0 3998 199S. 30Q0fwr«Knu tot-. Uc 



Fig. 10 A 
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My Book Outline 
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To select anotli^r Custom Book Outline, 
or to create new Custom Book Outline, 
click on the red M CBO" button here* 



This is Your Custom Book Outline 



m 



Add Your 
Content (IPM) to: 
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Move 
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undo 
Last Action 



$ 



Ctear 
My Book 




Sook Title 7 ^ 

Internet Basics j ^ 

Contents 
VOLUME 1 

Table of Contents 

Preface 

Index 

TOTAL 



Type the name of a chapter you would like to add 
to this book and press the *Add" button. 
Add new thapter with this title 

New Chapter Title \ 



select either book or chapter from the drop down 
55S new name in the text box and press the 
"Change* button to change the title. 
Change J*52*J3 title to ~ 



Price l»9s.* 
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0.90 
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$2-00 20.0 
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> Swch Contents 

► Rudy U*k Book* 



>* 0 Browse Library 



Select a title below to view its table of contents and start 
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Appendix A - Example of Prepublished Content Product Entity Structure 
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PRODUCT.C:01 30808598.00.00.00 
!SKU:0000000014595 
!ISBN:01 30808598 
!Title:Engineering Success 
!Contrib_Group 

! Contrib_First_Name: Peter 

! ContribJLast_Name:Schiavone 

! Contrib_Affiliation:University of Alberta 

!PE_ID:FE 

!Status:0 

!Page_Count:0 

!Use_Actuals:1 

!Yr_of_Pub:1999 

!instr_Only:N 

!Edition:01 

!Revision:00 

!Version:01.00 

!Pub_Med_Type:Paper 

!Primary_Lang:en-US 

!Publisher:Prentice Hall 

!Rts_Country_Of_Origin:USA 

!Rts_Dom_Print_Product:Y 

!Rts_Dom_Electronic_Product:Y 

!Rts_Dom_Custom_Product:Y 
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!Rts_lnt_Print_Product:N 

!Rts_lnt_Electronic_Product:N 

!Rts_lnt_Custom„Product:N 
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!Rts_Reciprocity:N 

!Rts_Attribution:Y 

!Created_By:BARKER 

!LastModified_By:BARKER 

FRONT_AND_BACK_ELEMENT:01 30808598.01 .01 .00 

! Filename:01 30808598.01. 01. OO.sgm 

!CDAOID:AABQHDG0 

!Role:TITLEGROUP 

!Title:Engineering Success 

!SGML„Char_Cnt:19 
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! Contrib_Last_Name:Schiavone 

! Contrib_Affiliation:University of Alberta 

FRONT_AND_BACK_ELEMENT:01 30808598.01 .03.00 

!Filename:01 30808598.01 .03.00.sgm 

!CDAOID:AABQHDI0 

!Role:AFF 

!SGML_Char_Cnt:21 

!Title:Affiliation 

!PE_ID:FE 

!Yr_of_Pub:1999 

!lnstr_Only:N 

! Edition :01 

!Revision:00 

!PubJVIed_Type:Paper 

!Status:0 

!Page__Count:0 

!Created_By:BARKER 

!Contrib_Group 

! Contrib_First_Name:Peter 

! Contrib_LastJ\lame:Schiavone 

! Contrib_Affiliation:University of Alberta 

FRONT_AND_BACK_ELEMENT:01 30808598.01 .04.00 

!Filename:01 30808598.01 .04.00.sgm 

!CDAOID:AABQHDJ0 

!Role:VERBATIMTOC 

!SGML_Char_Cnt:1833 

!Title:Table of Contents 

!PE_ID:FE 

!Yr_of_Pub:1999 

!lnstr_Only:N 

!Edition:01 

!Revision:00 

!Pub_Med_Type:Paper 

!Status:0 

!Page„Count:0 

!Created_By:BARKER 

!Contrib_Group 

! Contrib„First_Name:Peter 
! Contrib_Last_Name:Schiavone 



A-2 



STL000023US1 



! Contrib_Affiliation:University of Alberta 
CHAPTER.C:01 30808598.02.00.00 
!SKU:0000000014601 

!Title:Studying Engineering: The Keys to Success 

!Authored„Abstract:What does it take to be successful in engineering? The good news is that we 
know the answer to this question: Thousands of engineering students have been doing it for years. As a 
freshman engineering student, your biggest advantage lies in the fact that many people have already done 
what you have decided to do, namely, graduate in engineering. To find out what you need to do, you need 
only draw from the experiences of the many successful engineering students that have gone before you. 
That is what this chapter (and most of this book) is about: the tried and tested techniques that will 
guarantee you success in engineering study. 

!Authored_Abstract:The most successful engineering students exhibit common key 
characteristics in their approach to engineering study. The following table lists those characteristics, along 
with actions typically associated with each:CharacteristicActionsCommitment Decide to be successful.Set 
appropriate goals.Stay focused.Stay determined to succeed.Continually remind yourself of the reasonsyou 
chose engineering.ApplicationApply yourself fully to attain your goals. Work hard.StrategyWork 
smart.Maximize effectiveness.Learn the rules and play the game. Perseverance Don’t give up after 
the first, second, or third try.Keep going.Stay focused on your goals;Use power 

thinkinglAssociationsAssociate with people that maintain a positive attitude, people that will help you attain 
your goals.Avoid underachievers and those who do not share your objectives. 

!Authored_Abstract:ln the sections that follow, we discuss each of the preceding characteristics 
and how they will guide you to success as an engineering student. 

!PE_ID:FE 

!Yr„of_Pub:1999 

!lnstr_Only:N 

!Edition:01 

!Revision:00 

!PubJVIed_Type:Paper 

!Status:0 

!Page_Count:0 

!Created_By:BARKER 

!Contrib_Group 

! Contrib_First_Name: Peter 
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! Contrib_Affiliation:University of Alberta 
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!lndex_Term:associations 
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!SGML_Char__Cnt:1979 

!PE_ID:FE 

!Yr„of_Pub:1999 

!lnstr_Only:N 

!Edition:01 

!Revision:00 
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!Status:0 

!Page_Count:0 
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!Created_By:BARKER 
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! Contrib_First„Name:Peter 

! ContribJ_ast_Name:Schiavone 

! Contrib_Affiliation:University of Alberta 
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!lndex_Term:rehearse actual test 

!lndex_Term:examination 

!lndex_Term:technique 

!lndex_Term:time-constrained 

!lndex„Term:exam anxiety 

!lndex_Term:anxiety 

!lndex_Term:exam 

!lndex_Term:test 

! I ndex_Term: working smart, four hints four 

!index_Terim:rules of the game 

!lndex_Term:instructor, what he/she assumes you know 

!lndex_Term:cold-start position 

!lndex_Term:lectures 

!lndex_Term:making the most of 

!lndex_Term:class time, making effective use 

!lndex_Term:textbook as a supplement 

!lndex_Term:effective solutions, writing to all problems 

!lndex__Term:solution 

!lndex_Term:problems, effective writing 

!lndex_Term:clear procedure 

!lndex_Term:detailed 

!lndex_Term:problem solving 

!lndex_Term:procedure for effective 

!lndex_Term:practice, importance of 

!lndex_Term:asking questions, importance of 

!lndex„Term:developing 

!lndex_Term:rehearse 

!lndex_Term:write 

!Title:Strategy 

!SGML_Char_Cnt:10106 
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!Revision:00 

!Pub_Med_Type:Paper 

!Status:0 
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!Created_By:BARKER 
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! Contrib_First_Name:Peter 

! Contrib_Last__Name:Schiavone 

! Contrib„Affiliation:University of Alberta 

SECTION:01 30808598.02.05.00 

!Filename:0130808598.02.05.00.sgm 

!CDAOID:AABQHDO0 

!lndex_Term:engineering, study 

!lndex__Term:as key to success 

!lndex_Term:perseverance 

!lndex_Term:power-thinking zone 

!lndex_Term:ingenuity, increasing with perseverance 

ITitle: Perseverance 

!SGML_Char„Cnt:5632 

!PE_ID:FE 

!Yr_of_Pub:1999 
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!Edition:01 

!Revision:00 

!Pub_Med_Type:Paper 

!Status:0 

!Page_Count:0 

!Created_By:BARKER 

!Contrib„Group 

! Contrib_First_Name:Peter 

! Cont ri b_Last__N ame: Sch iavo ne 

! Contrib„Affiliation:University of Alberta 

SECTION:01 30808598.02.06.00 

! Filename:01 30808598.02.06.00.sgm 

!CDAOID:AABQHDP0 

!lndex„Term:associations 
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!lndex_Term:as key to success 

!Title:Associations 
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!Edition:01 

!Revision:00 

!Pub_Med_Type:Paper 

!Status:0 
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! Contrib_Affiliation:University of Alberta 

FRONT_AND„BACK_ELEMENT:01 30808598.02.07.00 
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! C D AO I D : AABQ H DQ0 

!Role:CH.END 

ITitle: Problems 

!lndex„Term:engineering, study 
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!SGML_Char_Cnt:4503 
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!lnstr_Only:N 

!Edition:01 

!Revision:00 

!Pub_Med_Type:Paper 

!Status:0 
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ITitle: Introduction to Engineering and Engineering Study 



A-6 



STL000023US1 



!Authored_Abstract:“How much do you know about engineering? Why did you choose to 
study engineering?What reasons lead you to believe that you are ready and equipped to study 
engineering?What are the main differences between studying at a university and studying in high 
school?What new success skills do you need to succeed in engineering study?Can you write down 10 
answers to each question I have asked you? Go ahead and try.” 

!Authored_Abstract:This is often how I begin my lecture to freshman engineering students 
enrolled in an introductory engineering class. After a little thought, most of them realize just how little they 
know about this subject called engineering and (often despite excellent high school averages) how ill 
equipped they are to study engineering. 

!Authored_Abstract:ln this chapter, we address both issues. First, we ask the following 
questions:What is engineering?What do engineers do?Why choose to study engineering? 

!Authored_Abstract:The answers to these questions are not only interesting and informative, but 
will help keep you motivated along the long, hard road to an engineering degree. Ability and hard work 
might get you through the initial stages, but after that, you must have a driving force, something that will 
sustain you through the hard times. You must develop a powerful motivation. The best way to do this is to 
learn as much as possible about the rewards of an engineering degree. Perhaps write them out and pin 
them on your wall or paste them inside your calculus book. Keep them close at hand. They will keep you 
determined and strong. This is exactly what the most successful engineering students do; they remain 
focused by keeping in mind the reasons they chose engineering and the rewards associated with entering 
the engineering profession. Make it a priority to keep learning about engineering, so that you will become 
aware of all the opportunities and rewards as they arise throughout your course of study. Thi 
s will fuel your motivation and your desire to succeed. The more important it becomes for you to graduate, 
the more likely you are to do so. 

!Authored_Abstract:ln , we address the question, “Are you prepared and equipped for 
engineering study?” In doing so, we examine the study skills required to succeed in the university 
environment. For many students, the university is the next logical step after high school, the next 
academic challenge. Consequently, they expect their freshman year in engineering to be much like 
another year of high school-which, of course, it isn’t. In engineering, such an exception often 
manifests itself in unacceptabiy high first-year attrition rates. We address this issue by focusing on what 
you need to do to ensure the best possible start to earning your engineering degree. Essentially, you must 
develop the necessary:Work strategiesStudy strategiesAttitudesCommunication skillsAbility to work as 
part of a teamTime management skills 

!PE_ID:FE 
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! Contrib__First_Name:Peter 

! ContribJ_ast_Name:Schiavone 
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!Filename:01 30808598.03.01 .OO.sgm 
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!lndex_Term:engineering, study 

!lndex_Term:introduction to 
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!lndex_Term:attitudes 

!lndex_Term:skills 

!Index_Term:communication 
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!lndex_Term:time management 
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!lndex_Term:engineering 
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!lndex_Term:engineering, study 
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! ACFORMID:2 

! NUMBERAC:1 

!Associated_Component 

! AC_PE_ID:FE 

! AC_CDAOID:AABQHDT0 

! AC_Title:F!G1 

! AC_I mage_Type:TI F F 

! AC_Graphic_Filename:HiRes\AABQHDTO.TIF 

! ACJmage„RoIe:Display 
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!PEJD:FE 
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!lnstr_Only:N 
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!Filename:0130808598.03.03.00.sgm 
!CDAOID:AABQHDU0 
!lndex_Term:engineers 
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!lndex_Term:engineering, study 
!lndex_Term:introduction to 
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!lndex_Term: mechanical engineering 
!lndex_Term:engineering 
!lndex_Term:mechanical 
!lndex_Term:engineering specialization 
!lndex_Term:electrical engineering 
!lndex_Term:electrical 
!lndex_Term:civil engineering 
!lndex_Term:civil 

!lndex_Term:chemical engineering 
!lndex_Term:chemical 
!lndex_Term:industrial engineering 
!lndex_Term:industrial 
!lndex_Term:engineering, types of 
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!lndex_Term:devices 
!lndex_Term:vibrations of structures 
!lndex_Term:manufacturing 
!index_Term:processes 
!lndex_Term:energy conversion systems 
!lndex_Term:heating systems 
!lndex_Term:systems 
!lndex_Term:heating 
!index_Term:ventilation systems 
!lndex_Term:ventilation 
!lndex_Term:air-conditioning systems 
!lndex_Term:air-conditioning 
!lndex_Term:solid mechanics 
!lndex_Term:fluid mechanics 
!lndex__Term:thermodynamics 
!lndex_Term:mechanical design 
!lndex_Term:energy 
!lndex„Term:ceilular phones 
!lndex_J"erm:fax machines 
!lndex_Term:electric power engineering 
!lndex_Term:electric 
!lndex_Term:electric power 
!lndex_Term:communications 
!lndex_Term:control systems 
!lndex_Term:digital system 
!index_Term:digital systems 
!lndex_Term:electronics 
!lndex_Term:communications engineering 
!lndex_Term:wires 
!lndex_Term:coaxial cable 
!lndex_Term:fiber optics 
!lndex_Term:radio 
!lndex_Term:design 
!lndex_Term:digital circuits 
!lndex_Term:digital circuits design 
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!lndex_Term:analog circuit design 
!index_Term:signal processing 
!lndex_Term:control systems engineering 
!lndex_Term:control automated processes 
!lndex_Term:robotics 
!lndex_Term:oiI 

!lndex„Term;and gas extraction 
!lndex_Term:aerospace 
!lndex_Term:industry 
!lndex_Term:power systems 
!lndex_Term:automobile manufacturing 
!lndex_Term:household appliances 
!lndex_Term:digital system engineering 
!lndex_Term:digital electronics 
!lndex_Term:digital signal processing 
!lndex_Term:computers 
!lndex„Term:instrumentation 
!lndex„Term:circuits 
!lndex_Term:signa!s 
!lndex„Term:solid-state devices 
!lndex_Term:integrated circuits 
!lndex„Term:semiconductors 
!lndex„Term:computer aided design 
!lndex__Term:microwave systems 
!lndex_Term:personal computers 
!lndex_Term:personal stereo equupment 
!lndex_Term:airports 
!lndex_Term:building 
!lndex_Term:bridges 
!lndex_Term:harbors 
!lndex_Term:highways 
!lndex_Term:transit systems 
!lndex_Term:offshore drilling platforms 
!lndex_Term:waste 
!lndex_Term:collection structures 
!lndex_Term:water 
!lndex_Term:supply facilities 
!lndex_Term:public works 
!lndex_Term:construction engineering 
!lndex_Term:construction 
!lndex„Term:environmental engineering 
!lndex_Term:environmental 
!lndex_Term:geotechnical engineering 
!lndex_„Term:geotechnical 
!lndex„Term:structural engineering 
! I ndex_Term:structu rai 
!lndex_Term:surveying 
!lndex_Term:transportation 
!lndex„Term:resources engineering 
!lndex__Term:water resources 
!lndex_Term:architects 
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!lndex_Term:professional engineers 
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!lndex_Term:cleanups at sea 

!lndex_Term:spills 

!lndex_J"erm:earthquakes 

! I ndex_Term: landslid es 

!index_Term:groundwater seepage 

!lndex_Term:slopes 

!lndexj"erm:settlement of buildings 

!lndex_Term:underground facilities 

!lndex_Term:pavements 

! I ndex_Term:structures 

!lndex_Term:soils 

!lndex_Term:rock 

!lndex_Term:containment facilities 

!lndex_Term:high towers 

!lndex_Term:drilling platforms 
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!lndex_Term:aerial 
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! I ndex_Term: harbor development 
! I ndex„Term: hydroelectric power 
!lndex_Term:drainage 
!lndex_Term:irrigation 

! I ndex_Term: wastewater collection systems 
!lndex_Term:distribution systems 
!lndex_Term:flood control 
!lndex_Term:water resources engineering 
!lndex_Term:food 
!lndex_Term:products 
!lndex„Term:electricity 
!lndex_Term:gas 
!lndex„Term:plastics 
!!ndex__Term:materials 
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!lndex_Term:forestry 
!lndex_Term:processing 
!lndex_Term:petrochemical 
!lndex_Term:polymer engineering 
!lndex„Term:polymer 
!lndex_Term:biotechnology 
!lndex_Term:process control engineering 
!lndex_Term:process control 
!lndex_Term:management 
!lndex_Term:engineering management 
!lndex_Term:and natural gas 
!lndex_Term:composites 
!lndex_Term:films 
!index_Term:fibres 
!lndex„Term:rubbers 
!lndex_Term:polymer molecules 
!lndex_Term:fertilizers 
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!Authored_Abstract:Over the last 10 years, I have taught many different engineering courses at 
various levels, from an introductory level to a more advanced graduate level. At the end of each course, I 
approach the most successful students and ask them to describe their study or work habits and any 
special techniques they may have used to achieve that particular level of success. Year after year, the 
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high school curriculum. Recently, however, weaknesses in the secondary education system have meant 
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prerequisite works.Effective note taking.Making effective use of the course textbook.How to be effective 
on assignments. Using posted solutions to assignments.Using tutors and study guides. 
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!Title:How to Be Successful on Examinations 

!Authored_Abstract:Examinations invariably make up the single largest contribution to your final 
grade in any engineering course. 

!Authored_Abstract:This simple fact explains why many engineering students are focused on 
examinations (quizzes, midterms, and finals), rather than on an appreciation of the course material. Like it 
or not, your performance on examinations will more or less determine how well you do in your engineering 
courses. For this reason, it is essential to understand why the very best engineering students are so 
successful in examinations and to learn how to use this information to your advantage. 

!Authored_Abstract:So why do certain students perform so much better than others on 
examinations? Some students put it down to a simple matter of intelligence: “Oh, that girl is really 
smart. Her father is a physics teacher and her mother has a Ph.D. No wonder she scores over 90 percent 
on all hertests.” Others put it down to the lack of a social life: “That guy never goes out. He 
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does nothing but study. No wonder he performs so well on testsi” I suggest that neither is entirely 
correct and that the truth lies somewhere in between. At senior high school and junior college levels, 
intelligence alone is no longer sufficient to place someone in the top 5 of the class. There is far too much 
material to absorb and not enough time in which to absorb it-even if you devote all your time to studying. In 
fact, it has been my experience that the very best students have an extremely active social life. Indeed 
their level of success often increases with their level of activity. Achieving success 
has more to do with how you prepare for an examination and what you do to prepare. 
!Authored__Abstract:l learned early on in my academic career that 
!Authored_Abstract:knowing the course material ≠ success in course examinations. 
!Authored_Abstract:l recall my midterm examination from my first engineering mechanics course. 
I had worked consistently throughout the year, understanding the course material, doing every 
assignment, and working through extra practice problems (in much the same way as I did when I was in 
high school). I understood the main ideas and concepts, and I was able to apply them in different 
situations. So I was quite confident that I would do well on the examination. Imagine my surprise when I 
discovered that I had scored only 58 percent! Worse than that, many of those students scoring above me 
had performed poorly throughout the course having missed assignments and often asking me for help. I 
couldn’t understand why this happened. I had worked hard and I knew the material, so why 
wasn’t I performing to the best of my ability? I began to discover the answer to my question when I 
asked one of my classmates (who had the highest score on the test) how she had prepared for the 
examination. It became clear 

to me that there were some missing ingredients in my test preparation routine. Basically, it came down to 
two things: 

!Authored_Abstract:Smart practiceExamination technique 

!Authored_Abstract:My friend and I had both prepared well during the course. What made the 
difference in our midterm scores was what we did in preparing for the test itself. She had obviously 
regarded the test as a separate entity, targeting and tailoring all her efforts not solely toward reviewing the 
course material (as I did), but toward doing well on the test itself (smart practice). She had obtained many 
former and practice midterm examinations and rehearsed her performance, so that she had a much better 
idea of what was expected and how to demonstrate the required knowledge under a time constraint 
(examination technique). She was entirely focused on doing well on the examination. I, on the other hand, 
was focused on the course material, believing that to be sufficient to perform to the best of my ability on 
the midterm. 

!Authored_Abstract:To understand why my friend’s strategy was so much more effective 
than mine, let’s return to the car-driving analogy. None of us believe that we can pass a standard 
driving test simply by driving the way we do in everyday life. We recognize that a driving test requires us to 
demonstrate a distinct collection of maneuvers and exercises, based on basic driving skills, under 
examination conditions. Conversely, no one continues to drive the way they did during their driving test. 
The latter is a rehearsed performance, requiring specific targeted practice based on a knowledge of exact 
requirements (smart practice) and a focused effort to perform well under specific test conditions 
(examination technique). Consequently, in preparing for a driving test, we find out as much as we can 
about what is required and target our preparation (as effectively as possible) toward those particular goals. 
Exactly the same principles apply to preparing for any test, academic or otherwise. 

!Authored_Abstract:ln fact, since that first mechanics midterm, an acknowledgment of these basic 
principles has allowed me to perform to the best of my abilities on all subsequent examinations, such as 
academic examinations (in many different disciplines), driving tests, athletics competitions, or whatever 
requires me to demonstrate performance under a given set of constraints. 

!Authored_Abstract:ln this chapter, we discuss, in detail, the many different aspects of maximizing 
performance on examinations, including, in particular, the two main ingredients: smart practice and 
examination technique. 
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example, my colleague David Steigmann is a professor of mechanical engineering at the University of 
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!Title:Developing Engineering Skills 

iAuthored_Abstract:Excellent technical engineering skills are, in themselves, not sufficient to 
guarantee a successful career in engineering. The following is a list of skills that employers are constantly 
demanding.The ability to communicate effectively, including-The ability to write clear, coherent technical 
documents.-The ability to present well-researched, well-organized seminars confidently and 
competently The ability to work effectively in teams with people from different backgrounds/The ability to 
demonstrate creativity in all aspects of the profession/The ability to manage personnel and resources in 
8 engineering projects.The ability to keep up to date with developments during one’s years as a 

professional engineer. . . ,.„■,.* ■ 

iAuthored_Abstract:These skills are highly prized by employers. Acquiring such skills will not only 
give you the edge in employability, but also allow you to go well beyond your training as an engineer 
towards an ever-expanding number of exciting and challenging opportunities. 

iAuthored_Abstract:What can you do to give yourself a head start and acquire the foregoing skills 
before you graduate? We have already discussed how teamwork and independent learning skills Chapter 
4) ->can contribute to success in engineering study. Practicing these skills as an undergraduate 
engineering student is an excellent way to equip yourself beforehand with at least three of them:The ability 
to work effectively in teams with people from different backgrounds. Your experience with teamwork as an 
undergraduate engineering student will make this skill second nature to you by the time you graduate. Not 
only will you know how to work as part of a team, but you will have the ability to organize and manage 
teams The ability to keep up to date with developments in engineering. As an engineering student, you 
become well accustomed to learning and thinking independently. The ability to keep up to date with 
developments in engineering requires that you find and assimilate information independently, as requir 
ed This is exactly what you do as a resourceful, committed undergraduate engineering student.The ability 
to manage personnel and resources in engineering projects. Practicing teamwork and independent 
learning together means that you learn to manage not only your own resources (time, energy, money, 
etc ), but also the resources of those around you. You learn how to find information effectively, how to use 
that information to attain a defined goal, and how to communicate the information to other group 

members. , . . . . . 

iAuthored_Abstract:ln this chapter, we concern ourselves with the remaining two engineering 
skills:The ability to communicate effectively, includingThe ability to write clear, coherent technical 
documents/The ability to present well-researched, well-organized seminars confidently and 
competently.The ability to demonstrate creativity in all aspects of the profession. 
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